The Power of Resets in Online Reinforcement Learning | allainews.com

April 25, 2024, 7:42 p.m. | Zakaria Mhammedi, Dylan J. Foster, Alexander Rakhlin

cs.LG updates on arXiv.org arxiv.org

arXiv:2404.15417v1 Announce Type: new
Abstract: Simulators are a pervasive tool in reinforcement learning, but most existing algorithms cannot efficiently exploit simulator access -- particularly in high-dimensional domains that require general function approximation. We explore the power of simulators through online reinforcement learning with {local simulator access} (or, local planning), an RL protocol where the agent is allowed to reset to previously observed states and follow their dynamics during training. We use local simulator access to unlock new statistical guarantees that …

abstract access algorithms approximation arxiv cs.ai cs.lg domains exploit explore function general online reinforcement learning planning power protocol reinforcement reinforcement learning simulator stat.ml through tool type

More from arxiv.org / cs.LG updates on arXiv.org

LangProp: A code optimization framework using Large Language Models applied to driving 20 hours ago | arxiv.org

arxiv code cs.ai cs.lg +10

MRI Scan Synthesis Methods based on Clustering and Pix2Pix 20 hours ago | arxiv.org

abstract arxiv automated brain +16

Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters 20 hours ago | arxiv.org

abstract arxiv concept concepts +21

Improving Interpretation Faithfulness for Vision Transformers 20 hours ago | arxiv.org

abstract adversarial adversarial attacks architectures +21

Training robust and generalizable quantum models 20 hours ago | arxiv.org

abstract adversarial arxiv context +15

Causal Discovery Under Local Privacy 20 hours ago | arxiv.org

abstract application arxiv causal +19

From Neural Activations to Concepts: A Survey on Explaining Concepts in Neural Networks 20 hours ago | arxiv.org

abstract act arxiv concepts +13

It's About Time: Temporal References in Emergent Communication 20 hours ago | arxiv.org

abstract agents arxiv autonomous +21

Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning 20 hours ago | arxiv.org

arxiv cs.lg cs.ro reinforcement +3

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Alternance DATA/AI Engineer (H/F)

@ SQLI | Le Grand-Quevilly, France

View on ai-jobs.net