Ask Before You Act: Generalising to Novel Environments by Asking Questions. (arXiv:2209.04665v2 [cs.AI] UPDATED) | allainews.com

Sept. 14, 2022, 1:12 a.m. | Ross Murphy, Sergey Mosesov, Javier Leguina Peral, Thymo ter Doest

cs.LG updates on arXiv.org arxiv.org

Solving temporally-extended tasks is a challenge for most reinforcement
learning (RL) algorithms [arXiv:1906.07343]. We investigate the ability of an
RL agent to learn to ask natural language questions as a tool to understand its
environment and achieve greater generalisation performance in novel,
temporally-extended environments. We do this by endowing this agent with the
ability of asking "yes-no" questions to an all-knowing Oracle. This allows the
agent to obtain guidance regarding the task at hand, while limiting the access …

arxiv environments

More from arxiv.org / cs.LG updates on arXiv.org

Training towards significance with the decorrelated event classifier transformer neural network 2 hours ago | arxiv.org

abstract analysis application arxiv +28

An adaptive standardisation methodology for Day-Ahead electricity price forecasting 2 hours ago | arxiv.org

abstract algorithms arxiv complexity +18

SYNAuG: Exploiting Synthetic Data for Data Imbalance Problems 2 hours ago | arxiv.org

abstract arxiv cs.cv cs.lg +17

Semantic Positive Pairs for Enhancing Visual Representation Learning of Instance Discrimination methods 2 hours ago | arxiv.org

abstract algorithms arxiv augmentation +17

Description-Based Text Similarity 2 hours ago | arxiv.org

abstract arxiv cases cs.cl +14

Improving Gradient Methods via Coordinate Transformations: Applications to Quantum Machine Learning 2 hours ago | arxiv.org

abstract algorithms applications arxiv +13

A Generative Framework for Low-Cost Result Validation of Machine Learning-as-a-Service Inference 2 hours ago | arxiv.org

abstract applications arxiv as-a-service +26

Digital Over-the-Air Federated Learning in Multi-Antenna Systems 2 hours ago | arxiv.org

abstract arxiv communication computation +16

Bagging Provides Assumption-free Stability 2 hours ago | arxiv.org

abstract algorithm arxiv assumptions +15

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Technology Consultant Master Data Management (w/m/d)

@ SAP | Walldorf, DE, 69190

View on ai-jobs.net

Research Engineer, Computer Vision, Google Research

@ Google | Nairobi, Kenya

View on ai-jobs.net