Tactical Optimism and Pessimism for Deep Reinforcement Learning. (arXiv:2102.03765v4 [cs.LG] UPDATED) | allainews.com

Jan. 17, 2022, 2:10 a.m. | Ted Moskovitz, Jack Parker-Holder, Aldo Pacchiano, Michael Arbel, Michael I. Jordan

cs.LG updates on arXiv.org arxiv.org

In recent years, deep off-policy actor-critic algorithms have become a
dominant approach to reinforcement learning for continuous control. One of the
primary drivers of this improved performance is the use of pessimistic value
updates to address function approximation errors, which previously led to
disappointing performance. However, a direct consequence of pessimism is
reduced exploration, running counter to theoretical support for the efficacy of
optimism in the face of uncertainty. So which approach is best? In this work,
we show that …

arxiv learning optimism reinforcement learning

More from arxiv.org / cs.LG updates on arXiv.org

Stochastic Optimal Control Matching 11 hours ago | arxiv.org

arxiv control cs.lg cs.na +6

Value Approximation for Two-Player General-Sum Differential Games with State Constraints 11 hours ago | arxiv.org

abstract approximation arxiv constraints +20

Can We Edit Multimodal Large Language Models? 11 hours ago | arxiv.org

arxiv cs.ai cs.cl cs.cv +9

XIMAGENET-12: An Explainable AI Benchmark Dataset for Model Robustness Evaluation 11 hours ago | arxiv.org

ai benchmark arxiv benchmark cs.cv +7

Generalized Schr\"odinger Bridge Matching 11 hours ago | arxiv.org

arxiv bridge cs.lg generalized +3

Tight bounds on Pauli channel learning without entanglement 11 hours ago | arxiv.org

abstract algorithms arxiv cs.it +9

Automated mapping of virtual environments with visual predictive coding 11 hours ago | arxiv.org

abstract access algorithms arxiv +28

Confident Feature Ranking 11 hours ago | arxiv.org

abstract arxiv cs.ai cs.lg +14

Integrated Sensing-Communication-Computation for Edge Artificial Intelligence 11 hours ago | arxiv.org

abstract advanced and edge ai artificial +27

Senior Marketing Data Analyst

@ Amazon.com | Amsterdam, North Holland, NLD

View on ai-jobs.net

Senior Data Analyst

@ MoneyLion | Kuala Lumpur, Kuala Lumpur, Malaysia

View on ai-jobs.net

Data Management Specialist - Office of the CDO - Chase- Associate

@ JPMorgan Chase & Co. | LONDON, LONDON, United Kingdom

View on ai-jobs.net

BI Data Analyst

@ Nedbank | Johannesburg, ZA

View on ai-jobs.net

Head of Data Science and Artificial Intelligence (m/f/d)

@ Project A Ventures | Munich, Germany

View on ai-jobs.net

Senior Data Scientist - GenAI

@ Roche | Hyderabad RSS

View on ai-jobs.net