June 26, 2022, 1 p.m. | Edan Meyer

Edan Meyer www.youtube.com

OpenAI's new paper, "Video PreTraining(VPT): Learning to Act by Watching Unlabeled Online Videos" trains an agent to play Minecraft using a mixture of imitation learning (in the form of behavior cloning) and reinforcement learning (RL). The results are very impressive, training a 500 million parameter model that can obtain diamonds and occasionally craft diamond tools. I cover the paper in this video, talking about their approach of using and fine-tuning a foundational model with semi-supervised learning.

Outline
0:00 - Intro …

youtube

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Stagista Technical Data Engineer

@ Hager Group | BRESCIA, IT

Data Analytics - SAS, SQL - Associate

@ JPMorgan Chase & Co. | Mumbai, Maharashtra, India