[R] Hieros: Hierarchical Imagination on Structured State Space Sequence World Models | allainews.com

Jan. 5, 2024, 10:04 p.m. | /u/APaperADay

Machine Learning www.reddit.com

**OpenReview**: [https://openreview.net/forum?id=5j6wtOO6Fk](https://openreview.net/forum?id=5j6wtOO6Fk)

**arXiv**: [https://arxiv.org/abs/2310.05167](https://arxiv.org/abs/2310.05167)

**Code**: [https://github.com/Snagnar/Hieros](https://github.com/Snagnar/Hieros)

**Abstract**:

>One of the biggest challenges to modern deep reinforcement learning (DRL) algorithms is sample efficiency. Many approaches learn a world model in order to train an agent entirely in imagination, eliminating the need for direct environment interaction during training. However, these methods often suffer from either a lack of imagination accuracy, exploration capabilities, or runtime efficiency. We propose **Hieros**, a hierarchical policy that learns time abstracted world representations and imagines trajectories at multiple …

abstract accuracy agent algorithms capabilities challenges efficiency environment exploration imagination learn machinelearning modern reinforcement reinforcement learning sample train training world

More from www.reddit.com / Machine Learning

[D] ECCV-2024 reviews are out 4 hours ago | www.reddit.com

eccv machinelearning reviews

[D] ICLR Outstanding Paper Awards. Congratulations! 7 hours ago | www.reddit.com

abstract feature identify images +12

[D] Where does the term "feature" come from? 8 hours ago | www.reddit.com

call engineering feature features +8

[D] Any encoder only model having bigger max token than 512 (BERT, Roberta, etc)? 14 hours ago | www.reddit.com

advance bert bigger class +8

[R] AlphaMath Almost Zero: process Supervision without process 15 hours ago | www.reddit.com

abstract code errors however +15

[D] ECCV 2024 Review Discussion 15 hours ago | www.reddit.com

center conferences eccv machinelearning +5

[D] Is it a good idea for a 3rd year PhD student to start a … 17 hours ago | www.reddit.com

academic extra good hearing +7

[D] Use VQ-VAEs for SSL? 18 hours ago | www.reddit.com

create diffusion diffusion models embedding +10

[D] Matrix Profile vs. Deep Learning for Multivariate Time Series 20 hours ago | www.reddit.com

context curiosity data deep learning +16

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net