July 6, 2022, 1:11 a.m. | Hang Lai, Jian Shen, Weinan Zhang, Yimin Huang, Xing Zhang, Ruiming Tang, Yong Yu, Zhenguo Li

cs.LG updates on arXiv.org arxiv.org

Model-based reinforcement learning has attracted wide attention due to its
superior sample efficiency. Despite its impressive success so far, it is still
unclear how to appropriately schedule the important hyperparameters to achieve
adequate performance, such as the real data ratio for policy optimization in
Dyna-style model-based algorithms. In this paper, we first theoretically
analyze the role of real data in policy training, which suggests that gradually
increasing the ratio of real data yields better performance. Inspired by the
analysis, we …

arxiv learning lg reinforcement reinforcement learning scheduling

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Praktikum im Bereich eMobility / Charging Solutions - Data Analysis

@ Bosch Group | Stuttgart, Germany

Business Data Analyst

@ PartnerRe | Toronto, ON, Canada

Machine Learning/DevOps Engineer II

@ Extend | Remote, United States

Business Intelligence Developer, Marketing team (Bangkok based, relocation provided)

@ Agoda | Bangkok (Central World)