SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning. (arXiv:2205.13528v3 [cs.LG] UPDATED) | allainews.com

Sept. 1, 2022, 1:11 a.m. | Marco Bagatella, Sammy Christen, Otmar Hilliges

cs.LG updates on arXiv.org arxiv.org

Efficient exploration is a crucial challenge in deep reinforcement learning.
Several methods, such as behavioral priors, are able to leverage offline data
in order to efficiently accelerate reinforcement learning on complex tasks.
However, if the task at hand deviates excessively from the demonstrated task,
the effectiveness of such methods is limited. In our work, we propose to learn
features from offline data that are shared by a more diverse range of tasks,
such as correlation between actions and directedness. Therefore, …

arxiv exploration free learning policy reinforcement reinforcement learning state

More from arxiv.org / cs.LG updates on arXiv.org

Discovering Nuclear Models from Symbolic Machine Learning 20 hours ago | arxiv.org

abstract arxiv behavior challenge +12

Advancing Network Intrusion Detection: Integrating Graph Neural Networks with Scattering Transform and Node2Vec for Enhanced … 20 hours ago | arxiv.org

abstract analysis anomaly anomaly detection +19

A Closer Look at Spatial-Slice Features Learning for COVID-19 Detection 20 hours ago | arxiv.org

arxiv closer look covid covid-19 +9

RELIANCE: Reliable Ensemble Learning for Information and News Credibility Evaluation 20 hours ago | arxiv.org

abstract arxiv challenge cs.cl +19

Artwork Protection Against Neural Style Transfer Using Locally Adaptive Adversarial Color Attack 20 hours ago | arxiv.org

abstract adversarial artists artwork +18

GestaltMML: Enhancing Rare Genetic Disease Diagnosis through Multimodal Machine Learning Combining Facial Images and Clinical … 20 hours ago | arxiv.org

abstract arxiv clinical cs.cv +19

Isolated pulsar population synthesis with simulation-based inference 20 hours ago | arxiv.org

abstract arxiv astro-ph.he astro-ph.im +15

Domain-Specific Fine-Tuning of Large Language Models for Interactive Robot Programming 20 hours ago | arxiv.org

abstract advanced applications arxiv +27

Training of Neural Networks with Uncertain Data -- A Mixture of Experts Approach 20 hours ago | arxiv.org

abstract arxiv cs.lg data +17

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Senior Computer Vision Engineer

@ Motive | Pakistan - Remote

View on ai-jobs.net

Data Analyst III

@ Fanatics | New York City, United States

View on ai-jobs.net

Senior Data Scientist - Experian Health (This role is remote, from anywhere in the U.S.)

@ Experian | ., ., United States

View on ai-jobs.net

Senior Data Engineer

@ Springer Nature Group | Pune, IN

View on ai-jobs.net