June 11, 2024, 11 a.m. | Super Data Science: ML & AI Podcast with Jon Krohn

Reinforcement learning through human feedback (RLHF) has come a long way. In this episode, research scientist Nathan Lambert talks to @JonKrohnLearns about the technique’s origins of the technique. He also walks through other ways to fine-tune LLMs, and how he believes generative AI might democratize education.

