Hybrid Reinforcement Learning from Offline Observation Alone | allainews.com

June 12, 2024, 4:46 a.m. | Yuda Song, J. Andrew Bagnell, Aarti Singh

cs.LG updates on arXiv.org arxiv.org

arXiv:2406.07253v1 Announce Type: new
Abstract: We consider the hybrid reinforcement learning setting where the agent has access to both offline data and online interactive access. While Reinforcement Learning (RL) research typically assumes offline data contains complete action, reward and transition information, datasets with only state information (also known as observation-only datasets) are more general, abundant and practical. This motivates our study of the hybrid RL with observation-only offline dataset framework. While the task of competing with the best policy "covered" …

abstract access action agent arxiv cs.lg data datasets hybrid information interactive observation offline reinforcement reinforcement learning research state transition type while

More from arxiv.org / cs.LG updates on arXiv.org

Scientific Machine Learning Based Reduced-Order Models for Plasma Turbulence Simulations 13 hours ago | arxiv.org

abstract arxiv build construction +20

LEDITS++: Limitless Image Editing using Text-to-Image Models 13 hours ago | arxiv.org

abstract aim apply arxiv +22

InterVLS: Interactive Model Understanding and Improvement with Vision-Language Surrogates 13 hours ago | arxiv.org

abstract applications arxiv challenges +22

Multimodal and Force-Matched Imitation Learning with a See-Through Visuotactile Sensor 13 hours ago | arxiv.org

abstract arxiv challenges cs.ai +16

Empathy Detection from Text, Audiovisual, Audio or Physiological Signals: Task Formulations and Machine Learning Methods 13 hours ago | arxiv.org

abstract applications arxiv attention +19

Autoencoder-based Anomaly Detection System for Online Data Quality Monitoring of the CMS Electromagnetic Calorimeter 13 hours ago | arxiv.org

abstract anomaly anomaly detection arxiv +20

Gradient Coding with Iterative Block Leverage Score Sampling 13 hours ago | arxiv.org

abstract arxiv block coding +17

Contextual Dynamic Pricing with Strategic Buyers 13 hours ago | arxiv.org

abstract arxiv behavior consumer +18

On Convex Data-Driven Inverse Optimal Control for Nonlinear, Non-stationary and Stochastic Systems 13 hours ago | arxiv.org

abstract agent arxiv context +19

AI Focused Biochemistry Postdoctoral Fellow

@ Lawrence Berkeley National Lab | Berkeley, CA

View on ai-jobs.net

Senior Data Engineer

@ Displate | Warsaw

View on ai-jobs.net

PhD Student AI simulation electric drive (f/m/d)

@ Volkswagen Group | Kassel, DE, 34123

View on ai-jobs.net

AI Privacy Research Lead

@ Leidos | 6314 Remote/Teleworker US

View on ai-jobs.net

Senior Platform System Architect, Silicon

@ Google | New Taipei, Banqiao District, New Taipei City, Taiwan

View on ai-jobs.net

Fabrication Hardware Litho Engineer, Quantum AI

@ Google | Goleta, CA, USA

View on ai-jobs.net