Representation Learning for Online and Offline RL in Low-rank MDPs. (arXiv:2110.04652v3 [cs.LG] UPDATED) | allainews.com

Jan. 7, 2022, 2:10 a.m. | Masatoshi Uehara, Xuezhou Zhang, Wen Sun

cs.LG updates on arXiv.org arxiv.org

This work studies the question of Representation Learning in RL: how can we
learn a compact low-dimensional representation such that on top of the
representation we can perform RL procedures such as exploration and
exploitation, in a sample efficient manner. We focus on the low-rank Markov
Decision Processes (MDPs) where the transition dynamics correspond to a
low-rank transition matrix. Unlike prior works that assume the representation
is known (e.g., linear MDPs), here we need to learn the representation for the …

arxiv learning rl

More from arxiv.org / cs.LG updates on arXiv.org

Improving Intrusion Detection with Domain-Invariant Representation Learning in Latent Space 5 hours ago | arxiv.org

abstract arxiv cs.cr cs.lg +18

Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning Algorithms 5 hours ago | arxiv.org

algorithms arxiv benchmarking control +11

Partially Observable Stochastic Games with Neural Perception Mechanisms 5 hours ago | arxiv.org

abstract agent agents applications +21

Anytime-valid t-tests and confidence sequences for Gaussian means with unknown variance 5 hours ago | arxiv.org

abstract arxiv confidence cs.lg +10

SEED: Domain-Specific Data Curation With Large Language Models 5 hours ago | arxiv.org

abstract analytics applications arxiv +22

Runtime Stealthy Perception Attacks against DNN-based Adaptive Cruise Control Systems 5 hours ago | arxiv.org

abstract acc arxiv attacks +19

Rapid-INR: Storage Efficient CPU-free DNN Training Using Implicit Neural Representation 5 hours ago | arxiv.org

arxiv cpu cs.ai cs.ar +8

Analysis of Failures and Risks in Deep Learning Model Converters: A Case Study in the … 5 hours ago | arxiv.org

abstract analysis arxiv case +21

Unsupervised Solution Operator Learning for Mean-Field Games via Sampling-Invariant Parametrizations 5 hours ago | arxiv.org

abstract advances arxiv computational +17

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Vice President, Data Science, Marketplace

@ Xometry | North Bethesda, Maryland, Lexington, KY, Remote

View on ai-jobs.net

Field Solutions Developer IV, Generative AI, Google Cloud

@ Google | Toronto, ON, Canada; Atlanta, GA, USA

View on ai-jobs.net