Jan. 5, 2024, 10:04 p.m. | /u/APaperADay

Machine Learning www.reddit.com

**OpenReview**: [https://openreview.net/forum?id=5j6wtOO6Fk](https://openreview.net/forum?id=5j6wtOO6Fk)

**arXiv**: [https://arxiv.org/abs/2310.05167](https://arxiv.org/abs/2310.05167)

**Code**: [https://github.com/Snagnar/Hieros](https://github.com/Snagnar/Hieros)

**Abstract**:

>One of the biggest challenges to modern deep reinforcement learning (DRL) algorithms is sample efficiency. Many approaches learn a world model in order to train an agent entirely in imagination, eliminating the need for direct environment interaction during training. However, these methods often suffer from either a lack of imagination accuracy, exploration capabilities, or runtime efficiency. We propose **Hieros**, a hierarchical policy that learns time abstracted world representations and imagines trajectories at multiple …

abstract accuracy agent algorithms capabilities challenges efficiency environment exploration imagination learn machinelearning modern reinforcement reinforcement learning sample train training world

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US