Feb. 20, 2024, 5:45 a.m. | Guy Azran, Mohamad H. Danesh, Stefano V. Albrecht, Sarah Keren

cs.LG updates on arXiv.org arxiv.org

arXiv:2307.05209v3 Announce Type: replace-cross
Abstract: Recent studies show that deep reinforcement learning (DRL) agents tend to overfit to the task on which they were trained and fail to adapt to minor environment changes. To expedite learning when transferring to unseen tasks, we propose a novel approach to representing the current task using reward machines (RMs), state machine abstractions that induce subtasks based on the current task's rewards and dynamics. Our method provides agents with symbolic representations of optimal transitions from …

abstract abstractions adapt agents arxiv cs.ai cs.lg environment machine novel planning reinforcement reinforcement learning show studies tasks transfer type

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US