791: Reinforcement Learning from Human Feedback (RLHF) — with Dr. Nathan Lambert

June 11, 2024, 11 a.m. | Super Data Science: ML & AI Podcast with Jon Krohn

Super Data Science Podcast with Jon Krohn www.youtube.com

#ReinforcementLearning #RLHF #GenerativeAI

Reinforcement learning through human feedback (RLHF) has come a long way. In this episode, research scientist Nathan Lambert talks to @JonKrohnLearns about the technique’s origins of the technique. He also walks through other ways to fine-tune LLMs, and how he believes generative AI might democratize education.

This episode is brought to you by AWS Inferentia (https://go.aws/3zWS0au) and AWS Trainium (https://go.aws/3ycV6K0), and Crawlbase (https://crawlbase.com), the ultimate data crawling platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience …

feedback fine-tune generative generativeai human human feedback llms reinforcement reinforcement learning reinforcementlearning research research scientist rlhf talks through

Visit resource

More from www.youtube.com / Super Data Science Podcast with Jon Krohn

New to Bayesian Statistics? Start Here! 11 hours ago | www.youtube.com

alex bayesian bayesian modeling boosting +16

PyTensor's Under-the-Hood "Secrets" 15 hours ago | www.youtube.com

alex bayesian bayesian modeling boosting +16

Bayesians Never Trust Priors 100% 1 day, 8 hours ago | www.youtube.com

alex bayesian bayesian modeling boosting +16

794: Exciting (and Frightening!) Trends in Open-Source AI — with Jon Krohn (@JonKrohnLearns) 1 day, 15 hours ago | www.youtube.com

conference conway data data science +12

Low-Quality Data? Going Bayesian Could Help! 2 days, 8 hours ago | www.youtube.com

alex bayesian bayesian modeling boosting +18

How Computing Power Brought Back Bayesian Statistics 2 days, 15 hours ago | www.youtube.com

alex bayesian bayesian modeling boosting +19

Bayesian Statistics in a Nutshell 3 days, 8 hours ago | www.youtube.com

alex bayesian bayesian modeling boosting +16

A Cool, Practical Explanation of Bayesian Stats 3 days, 15 hours ago | www.youtube.com

alex bayesian bayesian modeling boosting +17

793: Bayesian Methods and Applications — with Alexandre Andorra 4 days, 15 hours ago | www.youtube.com

alex applications bayesian bayesian modeling +14

Senior Data Engineer

@ Displate | Warsaw

View on ai-jobs.net

Professor/Associate Professor of Health Informatics [LKCMedicine]

@ Nanyang Technological University | NTU Novena Campus, Singapore

View on ai-jobs.net

Research Fellow (Computer Science (and Engineering)/Electronic Engineering/Applied Mathematics/Perception Sciences)

@ Nanyang Technological University | NTU Main Campus, Singapore

View on ai-jobs.net

Java Developer - Assistant Manager

@ State Street | Bengaluru, India

View on ai-jobs.net

Senior Java/Python Developer

@ General Motors | Austin IT Innovation Center North - Austin IT Innovation Center North

View on ai-jobs.net

Research Associate (Computer Engineering/Computer Science/Electronics Engineering)

@ Nanyang Technological University | NTU Main Campus, Singapore

View on ai-jobs.net

all AI news

791: Reinforcement Learning from Human Feedback (RLHF) — with Dr. Nathan Lambert

More from www.youtube.com / Super Data Science Podcast with Jon Krohn

Jobs in AI, ML, Big Data

Senior Data Engineer

Professor/Associate Professor of Health Informatics [LKCMedicine]

Research Fellow (Computer Science (and Engineering)/Electronic Engineering/Applied Mathematics/Perception Sciences)

Java Developer - Assistant Manager

Senior Java/Python Developer

Research Associate (Computer Engineering/Computer Science/Electronics Engineering)