June 4, 2023, 1:30 p.m. | Edan Meyer

Edan Meyer www.youtube.com

Reinforcement learning is great, but environment interaction can be expensive. This paper proposes an RL algorithm based of successor features that takes advantage of passive data to learn about the world without acting itself.

Outline
0:00 - Intro
1:41 - Offline-RL
2:50 - Successor Features
13:34 - Algorithm
21:11 - Results
27:50 - Criticisms & Thoughts

Social Media
YouTube - https://youtube.com/c/EdanMeyer
Twitter - https://twitter.com/ejmejm1

Sources:
Paper - https://arxiv.org/pdf/2304.04782.pdf

algorithm data environment features intro learn offline paper reinforcement reinforcement learning videos world youtube

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Reporting & Data Analytics Lead (Sizewell C)

@ EDF | London, GB

Data Analyst

@ Notable | San Mateo, CA