June 13, 2024, 6 p.m. | Super Data Science: ML & AI Podcast with Jon Krohn

Super Data Science Podcast with Jon Krohn www.youtube.com

Reinforcement learning through human feedback (RLHF) has come a long way. In this episode, research scientist Nathan Lambert talks to @JonKrohnLearns about the technique’s origins of the technique. He also walks through other ways to fine-tune LLMs, and how he believes generative AI might democratize education.

Watch the full interview “791: Reinforcement Learning from Human Feedback (RLHF) — with Dr. Nathan Lambert” here: https://www.superdatascience.com/791

education feedback fine-tune generative human human feedback humanoid llms reinforcement reinforcement learning research research scientist rlhf robots talks through

Senior Data Engineer

@ Displate | Warsaw

Engineer III, Back-End Server (mult.)

@ Samsung Electronics | 645 Clyde Avenue, Mountain View, CA, USA

Senior Product Security Engineer - Cyber Security Researcher

@ Boeing | USA - Arlington, VA

Senior Manager, Software Engineering, DevOps

@ Capital One | Richmond, VA

PGIM Quantitative Solutions, Investment Multi-Asset Research (Hybrid)

@ Prudential Financial | Prudential Tower, 655 Broad Street, Newark, NJ

Cyber Security Engineer

@ HP | FTC02 - Fort Collins, CO East Link (FTC02)