Off-Policy Risk Assessment in Markov Decision Processes. (arXiv:2209.10444v1 [cs.LG]) | allainews.com

Sept. 22, 2022, 1:11 a.m. | Audrey Huang, Liu Leqi, Zachary Chase Lipton, Kamyar Azizzadenesheli

cs.LG updates on arXiv.org arxiv.org

Addressing such diverse ends as safety alignment with human preferences, and
the efficiency of learning, a growing line of reinforcement learning research
focuses on risk functionals that depend on the entire distribution of returns.
Recent work on \emph{off-policy risk assessment} (OPRA) for contextual bandits
introduced consistent estimators for the target policy's CDF of returns along
with finite sample guarantees that extend to (and hold simultaneously over) all
risk. In this paper, we lift OPRA to Markov decision processes (MDPs), where …

arxiv decision markov policy processes risk risk assessment

More from arxiv.org / cs.LG updates on arXiv.org

Stochastic Optimal Control Matching 1 day, 11 hours ago | arxiv.org

arxiv control cs.lg cs.na +6

Value Approximation for Two-Player General-Sum Differential Games with State Constraints 1 day, 11 hours ago | arxiv.org

abstract approximation arxiv constraints +20

Can We Edit Multimodal Large Language Models? 1 day, 11 hours ago | arxiv.org

arxiv cs.ai cs.cl cs.cv +9

XIMAGENET-12: An Explainable AI Benchmark Dataset for Model Robustness Evaluation 1 day, 11 hours ago | arxiv.org

ai benchmark arxiv benchmark cs.cv +7

Generalized Schr\"odinger Bridge Matching 1 day, 11 hours ago | arxiv.org

arxiv bridge cs.lg generalized +3

Tight bounds on Pauli channel learning without entanglement 1 day, 11 hours ago | arxiv.org

abstract algorithms arxiv cs.it +9

Automated mapping of virtual environments with visual predictive coding 1 day, 11 hours ago | arxiv.org

abstract access algorithms arxiv +28

Confident Feature Ranking 1 day, 11 hours ago | arxiv.org

abstract arxiv cs.ai cs.lg +14

Integrated Sensing-Communication-Computation for Edge Artificial Intelligence 1 day, 11 hours ago | arxiv.org

abstract advanced and edge ai artificial +27

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Integration Specialist

@ Accenture Federal Services | San Antonio, TX

View on ai-jobs.net

Geospatial Data Engineer - Location Intelligence

@ Allegro | Warsaw, Poland

View on ai-jobs.net

Site Autonomy Engineer (Onsite)

@ May Mobility | Tokyo, Japan

View on ai-jobs.net

Summer Intern, AI (Artificial Intelligence)

@ Nextech Systems | Tampa, FL

View on ai-jobs.net

Permitting Specialist/Wetland Scientist

@ AECOM | Chelmsford, MA, United States

View on ai-jobs.net