Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning Environments. (arXiv:2205.07015v3 [cs.LG] UPDATED) | allainews.com

Sept. 22, 2022, 1:12 a.m. | Ryan Sullivan, J. K. Terry, Benjamin Black, John P. Dickerson

cs.LG updates on arXiv.org arxiv.org

Visualizing optimization landscapes has led to many fundamental insights in
numeric optimization, and novel improvements to optimization techniques.
However, visualizations of the objective that reinforcement learning optimizes
(the "reward surface") have only ever been generated for a small number of
narrow contexts. This work presents reward surfaces and related visualizations
of 27 of the most widely used reinforcement learning environments in Gym for
the first time. We also explore reward surfaces in the policy gradient
direction and show for the …

arxiv environments reinforcement reinforcement learning

More from arxiv.org / cs.LG updates on arXiv.org

Stochastic Optimal Control Matching 41 minutes ago | arxiv.org

arxiv control cs.lg cs.na +6

Value Approximation for Two-Player General-Sum Differential Games with State Constraints 41 minutes ago | arxiv.org

abstract approximation arxiv constraints +20

Can We Edit Multimodal Large Language Models? 41 minutes ago | arxiv.org

arxiv cs.ai cs.cl cs.cv +9

XIMAGENET-12: An Explainable AI Benchmark Dataset for Model Robustness Evaluation 41 minutes ago | arxiv.org

ai benchmark arxiv benchmark cs.cv +7

Generalized Schr\"odinger Bridge Matching 41 minutes ago | arxiv.org

arxiv bridge cs.lg generalized +3

Tight bounds on Pauli channel learning without entanglement 41 minutes ago | arxiv.org

abstract algorithms arxiv cs.it +9

Automated mapping of virtual environments with visual predictive coding 41 minutes ago | arxiv.org

abstract access algorithms arxiv +28

Confident Feature Ranking 41 minutes ago | arxiv.org

abstract arxiv cs.ai cs.lg +14

Integrated Sensing-Communication-Computation for Edge Artificial Intelligence 41 minutes ago | arxiv.org

abstract advanced and edge ai artificial +27

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

AI Scientist/Engineer

@ OKX | Singapore

View on ai-jobs.net

Research Engineering/ Scientist Associate I

@ The University of Texas at Austin | AUSTIN, TX

View on ai-jobs.net

Senior Data Engineer

@ Algolia | London, England

View on ai-jobs.net

Fundamental Equities - Vice President, Equity Quant Research Analyst (Income & Value Investment Team)

@ BlackRock | NY7 - 50 Hudson Yards, New York

View on ai-jobs.net

Snowflake Data Analytics

@ Devoteam | Madrid, Spain

View on ai-jobs.net