On Effective Scheduling of Model-based Reinforcement Learning. (arXiv:2111.08550v3 [cs.LG] UPDATED) | allainews.com

July 6, 2022, 1:11 a.m. | Hang Lai, Jian Shen, Weinan Zhang, Yimin Huang, Xing Zhang, Ruiming Tang, Yong Yu, Zhenguo Li

cs.LG updates on arXiv.org arxiv.org

Model-based reinforcement learning has attracted wide attention due to its
superior sample efficiency. Despite its impressive success so far, it is still
unclear how to appropriately schedule the important hyperparameters to achieve
adequate performance, such as the real data ratio for policy optimization in
Dyna-style model-based algorithms. In this paper, we first theoretically
analyze the role of real data in policy training, which suggests that gradually
increasing the ratio of real data yields better performance. Inspired by the
analysis, we …

arxiv learning lg reinforcement reinforcement learning scheduling

More from arxiv.org / cs.LG updates on arXiv.org

Discovering Nuclear Models from Symbolic Machine Learning 7 hours ago | arxiv.org

abstract arxiv behavior challenge +12

Advancing Network Intrusion Detection: Integrating Graph Neural Networks with Scattering Transform and Node2Vec for Enhanced … 7 hours ago | arxiv.org

abstract analysis anomaly anomaly detection +19

A Closer Look at Spatial-Slice Features Learning for COVID-19 Detection 7 hours ago | arxiv.org

arxiv closer look covid covid-19 +9

RELIANCE: Reliable Ensemble Learning for Information and News Credibility Evaluation 7 hours ago | arxiv.org

abstract arxiv challenge cs.cl +19

Artwork Protection Against Neural Style Transfer Using Locally Adaptive Adversarial Color Attack 7 hours ago | arxiv.org

abstract adversarial artists artwork +18

GestaltMML: Enhancing Rare Genetic Disease Diagnosis through Multimodal Machine Learning Combining Facial Images and Clinical … 7 hours ago | arxiv.org

abstract arxiv clinical cs.cv +19

Isolated pulsar population synthesis with simulation-based inference 7 hours ago | arxiv.org

abstract arxiv astro-ph.he astro-ph.im +15

Domain-Specific Fine-Tuning of Large Language Models for Interactive Robot Programming 7 hours ago | arxiv.org

abstract advanced applications arxiv +27

Training of Neural Networks with Uncertain Data -- A Mixture of Experts Approach 7 hours ago | arxiv.org

abstract arxiv cs.lg data +17

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Praktikum im Bereich eMobility / Charging Solutions - Data Analysis

@ Bosch Group | Stuttgart, Germany

View on ai-jobs.net

Business Data Analyst

@ PartnerRe | Toronto, ON, Canada

View on ai-jobs.net

Machine Learning/DevOps Engineer II

@ Extend | Remote, United States

View on ai-jobs.net

Business Intelligence Developer, Marketing team (Bangkok based, relocation provided)

@ Agoda | Bangkok (Central World)

View on ai-jobs.net