Sept. 14, 2022, 1:12 a.m. | Ross Murphy, Sergey Mosesov, Javier Leguina Peral, Thymo ter Doest

cs.LG updates on arXiv.org arxiv.org

Solving temporally-extended tasks is a challenge for most reinforcement
learning (RL) algorithms [arXiv:1906.07343]. We investigate the ability of an
RL agent to learn to ask natural language questions as a tool to understand its
environment and achieve greater generalisation performance in novel,
temporally-extended environments. We do this by endowing this agent with the
ability of asking "yes-no" questions to an all-knowing Oracle. This allows the
agent to obtain guidance regarding the task at hand, while limiting the access …

arxiv environments

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Technology Consultant Master Data Management (w/m/d)

@ SAP | Walldorf, DE, 69190

Research Engineer, Computer Vision, Google Research

@ Google | Nairobi, Kenya