all AI news
Optimal Transport Perturbations for Safe Reinforcement Learning with Robustness Guarantees
March 29, 2024, 4:43 a.m. | James Queeney, Erhan Can Ozcan, Ioannis Ch. Paschalidis, Christos G. Cassandras
cs.LG updates on arXiv.org arxiv.org
Abstract: Robustness and safety are critical for the trustworthy deployment of deep reinforcement learning. Real-world decision making applications require algorithms that can guarantee robust performance and safety in the presence of general environment disturbances, while making limited assumptions on the data collection process during training. In order to accomplish this goal, we introduce a safe reinforcement learning framework that incorporates robustness through the use of an optimal transport cost uncertainty set. We provide an efficient implementation …
abstract algorithms applications arxiv assumptions collection cs.ai cs.lg data data collection decision decision making deployment environment general making performance process reinforcement reinforcement learning robust robustness safety stat.ml training transport trustworthy type world
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US