Feb. 20, 2024, 5:45 a.m. | Guy Azran, Mohamad H. Danesh, Stefano V. Albrecht, Sarah Keren

cs.LG updates on arXiv.org arxiv.org

arXiv:2307.05209v3 Announce Type: replace-cross
Abstract: Recent studies show that deep reinforcement learning (DRL) agents tend to overfit to the task on which they were trained and fail to adapt to minor environment changes. To expedite learning when transferring to unseen tasks, we propose a novel approach to representing the current task using reward machines (RMs), state machine abstractions that induce subtasks based on the current task's rewards and dynamics. Our method provides agents with symbolic representations of optimal transitions from …

abstract abstractions adapt agents arxiv cs.ai cs.lg environment machine novel planning reinforcement reinforcement learning show studies tasks transfer type

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote