Quantification before Selection: Active Dynamics Preference for Robust Reinforcement Learning. (arXiv:2209.11596v2 [cs.LG] UPDATED) | allainews.com

Sept. 29, 2022, 1:12 a.m. | Kang Xu, Yan Ma, Wei Li

cs.LG updates on arXiv.org arxiv.org

Training a robust policy is critical for policy deployment in real-world
systems or dealing with unknown dynamics mismatch in different dynamic systems.
Domain Randomization~(DR) is a simple and elegant approach that trains a
conservative policy to counter different dynamic systems without expert
knowledge about the target system parameters. However, existing works reveal
that the policy trained through DR tends to be over-conservative and performs
poorly in target domains. Our key insight is that dynamic systems with
different parameters provide different …

arxiv dynamics quantification reinforcement reinforcement learning

More from arxiv.org / cs.LG updates on arXiv.org

Stochastic Optimal Control Matching 1 day, 11 hours ago | arxiv.org

arxiv control cs.lg cs.na +6

Value Approximation for Two-Player General-Sum Differential Games with State Constraints 1 day, 11 hours ago | arxiv.org

abstract approximation arxiv constraints +20

Can We Edit Multimodal Large Language Models? 1 day, 11 hours ago | arxiv.org

arxiv cs.ai cs.cl cs.cv +9

XIMAGENET-12: An Explainable AI Benchmark Dataset for Model Robustness Evaluation 1 day, 11 hours ago | arxiv.org

ai benchmark arxiv benchmark cs.cv +7

Generalized Schr\"odinger Bridge Matching 1 day, 11 hours ago | arxiv.org

arxiv bridge cs.lg generalized +3

Tight bounds on Pauli channel learning without entanglement 1 day, 11 hours ago | arxiv.org

abstract algorithms arxiv cs.it +9

Automated mapping of virtual environments with visual predictive coding 1 day, 11 hours ago | arxiv.org

abstract access algorithms arxiv +28

Confident Feature Ranking 1 day, 11 hours ago | arxiv.org

abstract arxiv cs.ai cs.lg +14

Integrated Sensing-Communication-Computation for Edge Artificial Intelligence 1 day, 11 hours ago | arxiv.org

abstract advanced and edge ai artificial +27

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Integration Specialist

@ Accenture Federal Services | San Antonio, TX

View on ai-jobs.net

Geospatial Data Engineer - Location Intelligence

@ Allegro | Warsaw, Poland

View on ai-jobs.net

Site Autonomy Engineer (Onsite)

@ May Mobility | Tokyo, Japan

View on ai-jobs.net

Summer Intern, AI (Artificial Intelligence)

@ Nextech Systems | Tampa, FL

View on ai-jobs.net

Permitting Specialist/Wetland Scientist

@ AECOM | Chelmsford, MA, United States

View on ai-jobs.net