Learning Value Functions from Undirected State-only Experience. (arXiv:2204.12458v1 [cs.LG]) | allainews.com

April 27, 2022, 1:12 a.m. | Matthew Chang, Arjun Gupta, Saurabh Gupta

cs.LG updates on arXiv.org arxiv.org

This paper tackles the problem of learning value functions from undirected
state-only experience (state transitions without action labels i.e. (s,s',r)
tuples). We first theoretically characterize the applicability of Q-learning in
this setting. We show that tabular Q-learning in discrete Markov decision
processes (MDPs) learns the same value function under any arbitrary refinement
of the action space. This theoretical result motivates the design of Latent
Action Q-learning or LAQ, an offline RL method that can learn effective value
functions from state-only …

arxiv experience learning state value

More from arxiv.org / cs.LG updates on arXiv.org

Discovering Nuclear Models from Symbolic Machine Learning 21 hours ago | arxiv.org

abstract arxiv behavior challenge +12

Advancing Network Intrusion Detection: Integrating Graph Neural Networks with Scattering Transform and Node2Vec for Enhanced … 21 hours ago | arxiv.org

abstract analysis anomaly anomaly detection +19

A Closer Look at Spatial-Slice Features Learning for COVID-19 Detection 21 hours ago | arxiv.org

arxiv closer look covid covid-19 +9

RELIANCE: Reliable Ensemble Learning for Information and News Credibility Evaluation 21 hours ago | arxiv.org

abstract arxiv challenge cs.cl +19

Artwork Protection Against Neural Style Transfer Using Locally Adaptive Adversarial Color Attack 21 hours ago | arxiv.org

abstract adversarial artists artwork +18

GestaltMML: Enhancing Rare Genetic Disease Diagnosis through Multimodal Machine Learning Combining Facial Images and Clinical … 21 hours ago | arxiv.org

abstract arxiv clinical cs.cv +19

Isolated pulsar population synthesis with simulation-based inference 21 hours ago | arxiv.org

abstract arxiv astro-ph.he astro-ph.im +15

Domain-Specific Fine-Tuning of Large Language Models for Interactive Robot Programming 21 hours ago | arxiv.org

abstract advanced applications arxiv +27

Training of Neural Networks with Uncertain Data -- A Mixture of Experts Approach 21 hours ago | arxiv.org

abstract arxiv cs.lg data +17

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Senior Computer Vision Engineer

@ Motive | Pakistan - Remote

View on ai-jobs.net

Data Analyst III

@ Fanatics | New York City, United States

View on ai-jobs.net

Senior Data Scientist - Experian Health (This role is remote, from anywhere in the U.S.)

@ Experian | ., ., United States

View on ai-jobs.net

Senior Data Engineer

@ Springer Nature Group | Pune, IN

View on ai-jobs.net