June 14, 2024, 6 p.m. | Super Data Science: ML & AI Podcast with Jon Krohn

Super Data Science Podcast with Jon Krohn www.youtube.com

Reinforcement learning through human feedback (RLHF) has come a long way. In this episode, research scientist Nathan Lambert talks to @JonKrohnLearns about the technique’s origins of the technique. He also walks through other ways to fine-tune LLMs, and how he believes generative AI might democratize education.

Watch the full interview “791: Reinforcement Learning from Human Feedback (RLHF) — with Dr. Nathan Lambert” here: https://www.superdatascience.com/791

education feedback fine-tune generative global human human feedback llms multimodal reinforcement reinforcement learning research research scientist rlhf talks through will

Senior Data Engineer

@ Displate | Warsaw

Professor/Associate Professor of Health Informatics [LKCMedicine]

@ Nanyang Technological University | NTU Novena Campus, Singapore

Research Fellow (Computer Science (and Engineering)/Electronic Engineering/Applied Mathematics/Perception Sciences)

@ Nanyang Technological University | NTU Main Campus, Singapore

Java Developer - Assistant Manager

@ State Street | Bengaluru, India

Senior Java/Python Developer

@ General Motors | Austin IT Innovation Center North - Austin IT Innovation Center North

Research Associate (Computer Engineering/Computer Science/Electronics Engineering)

@ Nanyang Technological University | NTU Main Campus, Singapore