all AI news
Tuning for the Unknown: Revisiting Evaluation Strategies for Lifelong RL
April 3, 2024, 4:42 a.m. | Golnaz Mesbahi, Olya Mastikhina, Parham Mohammad Panahi, Martha White, Adam White
cs.LG updates on arXiv.org arxiv.org
Abstract: In continual or lifelong reinforcement learning access to the environment should be limited. If we aspire to design algorithms that can run for long-periods of time, continually adapting to new, unexpected situations then we must be willing to deploy our agents without tuning their hyperparameters over the agent's entire lifetime. The standard practice in deep RL -- and even continual RL -- is to assume unfettered access to deployment environment for the full lifetime of …
abstract agents algorithms arxiv aspire continual cs.lg deploy design environment evaluation reinforcement reinforcement learning strategies the environment the unknown type
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US