Two-Sample Testing in Reinforcement Learning. (arXiv:2201.08078v1 [cs.LG]) | allainews.com

Jan. 21, 2022, 2:10 a.m. | Martin Waltz, Ostap Okhrin

cs.LG updates on arXiv.org arxiv.org

Value-based reinforcement-learning algorithms have shown strong performances
in games, robotics, and other real-world applications. The most popular
sample-based method is $Q$-Learning. A $Q$-value is the expected return for a
state-action pair when following a particular policy, and the algorithm
subsequently performs updates by adjusting the current $Q$-value towards the
observed reward and the maximum of the $Q$-values of the next state. The
procedure introduces maximization bias, and solutions like Double $Q$-Learning
have been considered. We frame the bias problem statistically …

arxiv learning reinforcement learning testing

More from arxiv.org / cs.LG updates on arXiv.org

Stochastic Optimal Control Matching 1 day, 9 hours ago | arxiv.org

arxiv control cs.lg cs.na +6

Value Approximation for Two-Player General-Sum Differential Games with State Constraints 1 day, 9 hours ago | arxiv.org

abstract approximation arxiv constraints +20

Can We Edit Multimodal Large Language Models? 1 day, 9 hours ago | arxiv.org

arxiv cs.ai cs.cl cs.cv +9

XIMAGENET-12: An Explainable AI Benchmark Dataset for Model Robustness Evaluation 1 day, 9 hours ago | arxiv.org

ai benchmark arxiv benchmark cs.cv +7

Generalized Schr\"odinger Bridge Matching 1 day, 9 hours ago | arxiv.org

arxiv bridge cs.lg generalized +3

Tight bounds on Pauli channel learning without entanglement 1 day, 9 hours ago | arxiv.org

abstract algorithms arxiv cs.it +9

Automated mapping of virtual environments with visual predictive coding 1 day, 9 hours ago | arxiv.org

abstract access algorithms arxiv +28

Confident Feature Ranking 1 day, 9 hours ago | arxiv.org

abstract arxiv cs.ai cs.lg +14

Integrated Sensing-Communication-Computation for Edge Artificial Intelligence 1 day, 9 hours ago | arxiv.org

abstract advanced and edge ai artificial +27

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Lead Software Engineer - Artificial Intelligence, LLM

@ OpenText | Hyderabad, TG, IN

View on ai-jobs.net

Lead Software Engineer- Python Data Engineer

@ JPMorgan Chase & Co. | GLASGOW, LANARKSHIRE, United Kingdom

View on ai-jobs.net

Data Analyst (m/w/d)

@ Collaboration Betters The World | Berlin, Germany

View on ai-jobs.net

Data Engineer, Quality Assurance

@ Informa Group Plc. | Boulder, CO, United States

View on ai-jobs.net

Director, Data Science - Marketing

@ Dropbox | Remote - Canada

View on ai-jobs.net