Distributionally Robust Constrained Reinforcement Learning under Strong Duality | allainews.com

June 25, 2024, 4:48 a.m. | Zhengfei Zhang, Kishan Panaganti, Laixi Shi, Yanan Sui, Adam Wierman, Yisong Yue

cs.LG updates on arXiv.org arxiv.org

arXiv:2406.15788v1 Announce Type: new
Abstract: We study the problem of Distributionally Robust Constrained RL (DRC-RL), where the goal is to maximize the expected reward subject to environmental distribution shifts and constraints. This setting captures situations where training and testing environments differ, and policies must satisfy constraints motivated by safety or limited budgets. Despite significant progress toward algorithm design for the separate problems of distributionally robust RL and constrained RL, there do not yet exist algorithms with end-to-end convergence guarantees for …

abstract arxiv budgets constraints cs.lg distribution environmental environments policies problem reinforcement reinforcement learning robust safety study testing training type

More from arxiv.org / cs.LG updates on arXiv.org

Bayesian identification of nonseparable Hamiltonians with multiplicative noise using deep learning and reduced-order modeling 1 day, 2 hours ago | arxiv.org

abstract arxiv bayesian cs.lg +17

MMGPL: Multimodal Medical Data Analysis with Graph Prompt Learning 1 day, 2 hours ago | arxiv.org

abstract analysis arxiv cs.cv +16

Self-Supervised Detection of Perfect and Partial Input-Dependent Symmetries 1 day, 2 hours ago | arxiv.org

arxiv cs.cv cs.lg detection +3

MixerFlow: MLP-Mixer meets Normalising Flows 1 day, 2 hours ago | arxiv.org

abstract architectures arxiv context +15

Machine Learning-Enabled Software and System Architecture Frameworks 1 day, 2 hours ago | arxiv.org

abstract architecture arxiv concerns +22

Efficient Interaction-Aware Interval Analysis of Neural Network Feedback Loops 1 day, 2 hours ago | arxiv.org

abstract analysis arxiv cs.lg +19

Kernelised Normalising Flows 1 day, 2 hours ago | arxiv.org

abstract architecture arxiv capabilities +14

GSplit: Scaling Graph Neural Network Training on Large Graphs via Split-Parallelism 1 day, 2 hours ago | arxiv.org

abstract arxiv class cs.dc +25

Reinforcement Learning in Credit Scoring and Underwriting 1 day, 2 hours ago | arxiv.org

abstract action adapt arxiv +17

Performance Marketing Manager

@ Jerry | New York City

View on ai-jobs.net

Senior Growth Marketing Manager (FULLY REMOTE)

@ Jerry | Seattle, WA

View on ai-jobs.net

Growth Marketing Channel Manager

@ Jerry | New York City

View on ai-jobs.net

Azure Integration Developer - Consultant - Bangalore

@ KPMG India | Bengaluru, Karnataka, India

View on ai-jobs.net

Director - Technical Program Manager

@ Capital One | Bengaluru, In

View on ai-jobs.net

Lead Developer-Process Automation -Python Developer

@ Diageo | Bengaluru Karle Town SEZ

View on ai-jobs.net