all AI news
SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning. (arXiv:2205.13528v3 [cs.LG] UPDATED)
Sept. 1, 2022, 1:11 a.m. | Marco Bagatella, Sammy Christen, Otmar Hilliges
cs.LG updates on arXiv.org arxiv.org
Efficient exploration is a crucial challenge in deep reinforcement learning.
Several methods, such as behavioral priors, are able to leverage offline data
in order to efficiently accelerate reinforcement learning on complex tasks.
However, if the task at hand deviates excessively from the demonstrated task,
the effectiveness of such methods is limited. In our work, we propose to learn
features from offline data that are shared by a more diverse range of tasks,
such as correlation between actions and directedness. Therefore, …
arxiv exploration free learning policy reinforcement reinforcement learning state
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Senior Computer Vision Engineer
@ Motive | Pakistan - Remote
Data Analyst III
@ Fanatics | New York City, United States
Senior Data Scientist - Experian Health (This role is remote, from anywhere in the U.S.)
@ Experian | ., ., United States
Senior Data Engineer
@ Springer Nature Group | Pune, IN