Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems. (arXiv:2206.12020v1 [cs.LG]) | allainews.com

June 27, 2022, 1:11 a.m. | Masatoshi Uehara, Ayush Sekhari, Jason D. Lee, Nathan Kallus, Wen Sun

stat.ML updates on arXiv.org arxiv.org

We study Reinforcement Learning for partially observable dynamical systems
using function approximation. We propose a new \textit{Partially Observable
Bilinear Actor-Critic framework}, that is general enough to include models such
as observable tabular Partially Observable Markov Decision Processes (POMDPs),
observable Linear-Quadratic-Gaussian (LQG), Predictive State Representations
(PSRs), as well as a newly introduced model Hilbert Space Embeddings of POMDPs
and observable POMDPs with latent low-rank transition. Under this framework, we
propose an actor-critic style algorithm that is capable of performing agnostic
policy …

arxiv learning lg observable reinforcement reinforcement learning systems

More from arxiv.org / stat.ML updates on arXiv.org

Simultaneous upper and lower bounds of American option prices with hedging via neural networks 14 hours ago | arxiv.org

abstract arxiv form math.pr +11

Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF 1 day, 14 hours ago | arxiv.org

accounting arxiv context cs.ai +6

Hacking Task Confounder in Meta-Learning 1 day, 14 hours ago | arxiv.org

abstract arxiv cs.lg hacking +12

Reflection coupling for unadjusted generalized Hamiltonian Monte Carlo in the nonconvex stochastic gradient case 1 day, 14 hours ago | arxiv.org

abstract algorithms arxiv case +10

Provable Reward-Agnostic Preference-Based Reinforcement Learning 1 day, 14 hours ago | arxiv.org

abstract agent arxiv cs.ai +16

Mastering Diverse Domains through World Models 1 day, 14 hours ago | arxiv.org

abstract algorithm algorithms application +22

Precise Asymptotics for Spectral Methods in Mixed Generalized Linear Models 1 day, 14 hours ago | arxiv.org

abstract arxiv cs.it cs.lg +14

Additive Covariance Matrix Models: Modelling Regional Electricity Net-Demand in Great Britain 1 day, 14 hours ago | arxiv.org

abstract arxiv britain consumption +18

Learning Algorithm Generalization Error Bounds via Auxiliary Distributions 1 day, 14 hours ago | arxiv.org

abstract algorithm arxiv cs.it +16

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

IT Commercial Data Analyst - ESO

@ National Grid | Warwick, GB, CV34 6DA

View on ai-jobs.net

Stagiaire Data Analyst – Banque Privée - Juillet 2024

@ Rothschild & Co | Paris (Messine-29)

View on ai-jobs.net

Operations Research Scientist I - Network Optimization Focus

@ CSX | Jacksonville, FL, United States

View on ai-jobs.net

Machine Learning Operations Engineer

@ Intellectsoft | Baku, Baku, Azerbaijan - Remote

View on ai-jobs.net

Data Analyst

@ Health Care Service Corporation | Richardson Texas HQ (1001 E. Lookout Drive)

View on ai-jobs.net