June 25, 2024, 4:48 a.m. | Zhengfei Zhang, Kishan Panaganti, Laixi Shi, Yanan Sui, Adam Wierman, Yisong Yue

cs.LG updates on arXiv.org arxiv.org

arXiv:2406.15788v1 Announce Type: new
Abstract: We study the problem of Distributionally Robust Constrained RL (DRC-RL), where the goal is to maximize the expected reward subject to environmental distribution shifts and constraints. This setting captures situations where training and testing environments differ, and policies must satisfy constraints motivated by safety or limited budgets. Despite significant progress toward algorithm design for the separate problems of distributionally robust RL and constrained RL, there do not yet exist algorithms with end-to-end convergence guarantees for …

abstract arxiv budgets constraints cs.lg distribution environmental environments policies problem reinforcement reinforcement learning robust safety study testing training type

Performance Marketing Manager

@ Jerry | New York City

Senior Growth Marketing Manager (FULLY REMOTE)

@ Jerry | Seattle, WA

Growth Marketing Channel Manager

@ Jerry | New York City

Azure Integration Developer - Consultant - Bangalore

@ KPMG India | Bengaluru, Karnataka, India

Director - Technical Program Manager

@ Capital One | Bengaluru, In

Lead Developer-Process Automation -Python Developer

@ Diageo | Bengaluru Karle Town SEZ