all AI news
Distributionally Robust Constrained Reinforcement Learning under Strong Duality
June 25, 2024, 4:48 a.m. | Zhengfei Zhang, Kishan Panaganti, Laixi Shi, Yanan Sui, Adam Wierman, Yisong Yue
cs.LG updates on arXiv.org arxiv.org
Abstract: We study the problem of Distributionally Robust Constrained RL (DRC-RL), where the goal is to maximize the expected reward subject to environmental distribution shifts and constraints. This setting captures situations where training and testing environments differ, and policies must satisfy constraints motivated by safety or limited budgets. Despite significant progress toward algorithm design for the separate problems of distributionally robust RL and constrained RL, there do not yet exist algorithms with end-to-end convergence guarantees for …
abstract arxiv budgets constraints cs.lg distribution environmental environments policies problem reinforcement reinforcement learning robust safety study testing training type
More from arxiv.org / cs.LG updates on arXiv.org
MixerFlow: MLP-Mixer meets Normalising Flows
1 day, 2 hours ago |
arxiv.org
Kernelised Normalising Flows
1 day, 2 hours ago |
arxiv.org
Jobs in AI, ML, Big Data
Performance Marketing Manager
@ Jerry | New York City
Senior Growth Marketing Manager (FULLY REMOTE)
@ Jerry | Seattle, WA
Growth Marketing Channel Manager
@ Jerry | New York City
Azure Integration Developer - Consultant - Bangalore
@ KPMG India | Bengaluru, Karnataka, India
Director - Technical Program Manager
@ Capital One | Bengaluru, In
Lead Developer-Process Automation -Python Developer
@ Diageo | Bengaluru Karle Town SEZ