Oct. 24, 2022, 1:12 a.m. | Juan Jose Garau-Luis, Yingjie Miao, John D. Co-Reyes, Aaron Parisi, Jie Tan, Esteban Real, Aleksandra Faust

cs.LG updates on arXiv.org arxiv.org

Deploying Reinforcement Learning (RL) agents in the real world requires
designing and tuning algorithms for problem-specific objectives such as
performance, robustness, or stability. These objectives can frequently change,
which will then necessitate further painstaking design and tuning. This paper
presents MetaPG, an evolutionary method for designing new loss functions for
actor-critic RL algorithms that optimize for different objectives. In
particular, we focus on the objectives of final performance in training regime,
policy robustness to unseen environment configurations, and training curve …

actor-critic algorithms arxiv

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Data Engineer

@ Cint | Gurgaon, India

Data Science (M/F), setor automóvel - Aveiro

@ Segula Technologies | Aveiro, Portugal