all AI news
Representation Learning for Online and Offline RL in Low-rank MDPs. (arXiv:2110.04652v3 [cs.LG] UPDATED)
Jan. 7, 2022, 2:10 a.m. | Masatoshi Uehara, Xuezhou Zhang, Wen Sun
cs.LG updates on arXiv.org arxiv.org
This work studies the question of Representation Learning in RL: how can we
learn a compact low-dimensional representation such that on top of the
representation we can perform RL procedures such as exploration and
exploitation, in a sample efficient manner. We focus on the low-rank Markov
Decision Processes (MDPs) where the transition dynamics correspond to a
low-rank transition matrix. Unlike prior works that assume the representation
is known (e.g., linear MDPs), here we need to learn the representation for the …
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Vice President, Data Science, Marketplace
@ Xometry | North Bethesda, Maryland, Lexington, KY, Remote
Field Solutions Developer IV, Generative AI, Google Cloud
@ Google | Toronto, ON, Canada; Atlanta, GA, USA