Learning to Constrain Policy Optimization with Virtual Trust Region. (arXiv:2204.09315v2 [cs.LG] UPDATED) | allainews.com

Sept. 19, 2022, 1:12 a.m. | Hung Le, Thommen Karimpanal George, Majid Abdolshah, Dung Nguyen, Kien Do, Sunil Gupta, Svetha Venkatesh

cs.LG updates on arXiv.org arxiv.org

We introduce a constrained optimization method for policy gradient
reinforcement learning, which uses a virtual trust region to regulate each
policy update. In addition to using the proximity of one single old policy as
the normal trust region, we propose forming a second trust region through
another virtual policy representing a wide range of past policies. We then
enforce the new policy to stay closer to the virtual policy, which is
beneficial if the old policy performs poorly. More importantly, …

arxiv optimization policy trust virtual

More from arxiv.org / cs.LG updates on arXiv.org

Stochastic Optimal Control Matching 4 hours ago | arxiv.org

arxiv control cs.lg cs.na +6

Value Approximation for Two-Player General-Sum Differential Games with State Constraints 4 hours ago | arxiv.org

abstract approximation arxiv constraints +20

Can We Edit Multimodal Large Language Models? 4 hours ago | arxiv.org

arxiv cs.ai cs.cl cs.cv +9

XIMAGENET-12: An Explainable AI Benchmark Dataset for Model Robustness Evaluation 4 hours ago | arxiv.org

ai benchmark arxiv benchmark cs.cv +7

Generalized Schr\"odinger Bridge Matching 4 hours ago | arxiv.org

arxiv bridge cs.lg generalized +3

Tight bounds on Pauli channel learning without entanglement 4 hours ago | arxiv.org

abstract algorithms arxiv cs.it +9

Automated mapping of virtual environments with visual predictive coding 4 hours ago | arxiv.org

abstract access algorithms arxiv +28

Confident Feature Ranking 4 hours ago | arxiv.org

abstract arxiv cs.ai cs.lg +14

Integrated Sensing-Communication-Computation for Edge Artificial Intelligence 4 hours ago | arxiv.org

abstract advanced and edge ai artificial +27

Data Engineer

@ Bosch Group | San Luis Potosí, Mexico

View on ai-jobs.net

DATA Engineer (H/F)

@ Renault Group | FR REN RSAS - Le Plessis-Robinson (Siège)

View on ai-jobs.net

Advisor, Data engineering

@ Desjardins | 1, Complexe Desjardins, Montréal

View on ai-jobs.net

Data Engineer Intern

@ Getinge | Wayne, NJ, US

View on ai-jobs.net

Software Engineer III- Java / Python / Pyspark / ETL

@ JPMorgan Chase & Co. | Jersey City, NJ, United States

View on ai-jobs.net

Lead Data Engineer (Azure/AWS)

@ Telstra | Telstra ICC Bengaluru

View on ai-jobs.net