all AI news
Sim-to-Lab-to-Real: Safe Reinforcement Learning with Shielding and Generalization Guarantees. (arXiv:2201.08355v1 [cs.RO])
Jan. 21, 2022, 2:11 a.m. | Kai-Chieh Hsu, Allen Z. Ren, Duy Phuong Nguyen, Anirudha Majumdar, Jaime F. Fisac
cs.LG updates on arXiv.org arxiv.org
Safety is a critical component of autonomous systems and remains a challenge
for learning-based policies to be utilized in the real world. In particular,
policies learned using reinforcement learning often fail to generalize to novel
environments due to unsafe behavior. In this paper, we propose
Sim-to-Lab-to-Real to safely close the reality gap. To improve safety, we apply
a dual policy setup where a performance policy is trained using the cumulative
task reward and a backup (safety) policy is trained by …
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
IT Data Engineer
@ Procter & Gamble | BUCHAREST OFFICE
Data Engineer (w/m/d)
@ IONOS | Deutschland - Remote
Staff Data Science Engineer, SMAI
@ Micron Technology | Hyderabad - Phoenix Aquila, India
Academically & Intellectually Gifted Teacher (AIG - Elementary)
@ Wake County Public School System | Cary, NC, United States