all AI news
ConserWeightive Behavioral Cloning for Reliable Offline Reinforcement Learning. (arXiv:2210.05158v1 [cs.LG])
Oct. 12, 2022, 1:11 a.m. | Tung Nguyen, Qinqing Zheng, Aditya Grover
cs.LG updates on arXiv.org arxiv.org
The goal of offline reinforcement learning (RL) is to learn near-optimal
policies from static logged datasets, thus sidestepping expensive online
interactions. Behavioral cloning (BC) provides a straightforward solution to
offline RL by mimicking offline trajectories via supervised learning. Recent
advances (Chen et al., 2021; Janner et al., 2021; Emmons et al., 2021) have
shown that by conditioning on desired future returns, BC can perform
competitively to their value-based counterparts, while enjoying much more
simplicity and training stability. However, the distribution …
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Praktikum im Bereich eMobility / Charging Solutions - Data Analysis
@ Bosch Group | Stuttgart, Germany
Business Data Analyst
@ PartnerRe | Toronto, ON, Canada
Machine Learning/DevOps Engineer II
@ Extend | Remote, United States
Business Intelligence Developer, Marketing team (Bangkok based, relocation provided)
@ Agoda | Bangkok (Central World)