On the Estimation Bias in Double Q-Learning. (arXiv:2109.14419v3 [cs.LG] UPDATED) | allainews.com

Jan. 17, 2022, 2:11 a.m. | Zhizhou Ren, Guangxiang Zhu, Hao Hu, Beining Han, Jianglun Chen, Chongjie Zhang

cs.LG updates on arXiv.org arxiv.org

Double Q-learning is a classical method for reducing overestimation bias,
which is caused by taking maximum estimated values in the Bellman operation.
Its variants in the deep Q-learning paradigm have shown great promise in
producing reliable value prediction and improving learning performance.
However, as shown by prior work, double Q-learning is not fully unbiased and
suffers from underestimation bias. In this paper, we show that such
underestimation bias may lead to multiple non-optimal fixed points under an
approximate Bellman operator. …

arxiv bias learning

More from arxiv.org / cs.LG updates on arXiv.org

Stochastic Optimal Control Matching 17 hours ago | arxiv.org

arxiv control cs.lg cs.na +6

Value Approximation for Two-Player General-Sum Differential Games with State Constraints 17 hours ago | arxiv.org

abstract approximation arxiv constraints +20

Can We Edit Multimodal Large Language Models? 17 hours ago | arxiv.org

arxiv cs.ai cs.cl cs.cv +9

XIMAGENET-12: An Explainable AI Benchmark Dataset for Model Robustness Evaluation 17 hours ago | arxiv.org

ai benchmark arxiv benchmark cs.cv +7

Generalized Schr\"odinger Bridge Matching 17 hours ago | arxiv.org

arxiv bridge cs.lg generalized +3

Tight bounds on Pauli channel learning without entanglement 17 hours ago | arxiv.org

abstract algorithms arxiv cs.it +9

Automated mapping of virtual environments with visual predictive coding 17 hours ago | arxiv.org

abstract access algorithms arxiv +28

Confident Feature Ranking 17 hours ago | arxiv.org

abstract arxiv cs.ai cs.lg +14

Integrated Sensing-Communication-Computation for Edge Artificial Intelligence 17 hours ago | arxiv.org

abstract advanced and edge ai artificial +27

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Applied Scientist, Control Stack, AWS Center for Quantum Computing

@ Amazon.com | Pasadena, California, USA

View on ai-jobs.net

Specialist Marketing with focus on ADAS/AD f/m/d

@ AVL | Graz, AT

View on ai-jobs.net

Machine Learning Engineer, PhD Intern

@ Instacart | United States - Remote

View on ai-jobs.net

Supervisor, Breast Imaging, Prostate Center, Ultrasound

@ University Health Network | Toronto, ON, Canada

View on ai-jobs.net

Senior Manager of Data Science (Recommendation Science)

@ NBCUniversal | New York, NEW YORK, United States

View on ai-jobs.net