Teaching Large Language Models to Reason with Reinforcement Learning with Alex Havrilla - #680 | allainews.com

April 16, 2024, 10:58 p.m. | Sam Charrington

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) twimlai.com

Today we're joined by Alex Havrilla, a PhD student at Georgia Tech, to discuss "Teaching Large Language Models to Reason with Reinforcement Learning." Alex discusses the role of creativity and exploration in problem solving and explores the opportunities presented by applying reinforcement learning algorithms to the challenge of improving reasoning in large language models. Alex also shares his research on the effect of noise on language model training, highlighting the robustness of LLM architecture. Finally, we delve into the future …

alex algorithms creativity discuss exploration georgia georgia tech language language models large language large language models opportunities phd reason reinforcement reinforcement learning role teaching tech

More from twimlai.com / The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Controlling Fusion Reactor Instability with Deep Reinforcement Learning with Aza Jalalvand - #682 2 days, 10 hours ago | twimlai.com

control discuss fusion nuclear +10

GraphRAG: Knowledge Graphs for AI Applications with Kirk Marple - #681 1 week, 2 days ago | twimlai.com

ai applications applications architecture ceo +13

GraphRAG: Knowledge Graphs for AI Applications with Kirk Marple - #681 1 week, 2 days ago | twimlai.com

ai applications applications architecture ceo +13

GraphRAG: Knowledge Graphs for AI Applications with Kirk Marple - #681 1 week, 2 days ago | twimlai.com

ai applications applications architecture ceo +13

Teaching Large Language Models to Reason with Reinforcement Learning with Alex Havrilla - #680 2 weeks, 1 day ago | twimlai.com

alex algorithms creativity discuss +15

Teaching Large Language Models to Reason with Reinforcement Learning with Alex Havrilla - #680 2 weeks, 1 day ago | twimlai.com

alex algorithms creativity discuss +15

Teaching Large Language Models to Reason with Reinforcement Learning with Alex Havrilla - #680 2 weeks, 1 day ago | twimlai.com

alex algorithms creativity discuss +15

Localizing and Editing Knowledge in LLMs with Peter Hase - #679 3 weeks, 2 days ago | twimlai.com

decisions discuss editing explore +15

Coercing LLMs to Do and Reveal (Almost) Anything with Jonas Geiping - #678 1 month ago | twimlai.com

agents explore highlighting institute +9

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Risk Management - Machine Learning and Model Delivery Services, Product Associate - Senior Associate-

@ JPMorgan Chase & Co. | Wilmington, DE, United States

View on ai-jobs.net

Senior ML Engineer (Speech/ASR)

@ ObserveAI | Bengaluru

View on ai-jobs.net