all AI news
Tuning for the Unknown: Revisiting Evaluation Strategies for Lifelong RL
April 3, 2024, 4:42 a.m. | Golnaz Mesbahi, Olya Mastikhina, Parham Mohammad Panahi, Martha White, Adam White
cs.LG updates on arXiv.org arxiv.org
Abstract: In continual or lifelong reinforcement learning access to the environment should be limited. If we aspire to design algorithms that can run for long-periods of time, continually adapting to new, unexpected situations then we must be willing to deploy our agents without tuning their hyperparameters over the agent's entire lifetime. The standard practice in deep RL -- and even continual RL -- is to assume unfettered access to deployment environment for the full lifetime of …
abstract agents algorithms arxiv aspire continual cs.lg deploy design environment evaluation reinforcement reinforcement learning strategies the environment the unknown type
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US