From Research to Production: Fine-Tuning & Aligning LLMs // Philipp Schmid // AI in Production | allainews.com

April 4, 2024, 4:45 p.m. | MLOps.community

MLOps.community www.youtube.com

// Abstract
Discover the essential steps in transitioning LLMs from research to production, with a focus on effective fine-tuning and alignment strategies. This session delves into how to fine-tune & evaluate LLMs with Supervised Fine-Tuning (SFT), Reinforcement Learning from Human Feedback (RLHF)/Direct Preference Optimization (DPO), and their practical applications for aligning LLMs with production goals.

// Bio
Philipp Schmid is a Technical Lead at Hugging Face with the mission to democratize good machine learning through open source and open science. …

abstract alignment direct preference optimization feedback fine-tuning focus human human feedback llms optimization production reinforcement reinforcement learning research rlhf session sft strategies supervised fine-tuning

More from www.youtube.com / MLOps.community

AI Innovations: The Power of Feature Platforms // MLOps Mini Summit #6 14 hours ago | www.youtube.com

abstract ai innovations build building +19

FEDML Nexus AI: Your Generative AI Platform at Scale // Salman Avestimehr // MLOps podcast … 1 day, 14 hours ago | www.youtube.com

abstract ai applications ai platform applications +15

What is AI Quality? // Mohamed Elgendy // MLOps Podcast #229 5 days, 13 hours ago | www.youtube.com

abstract ceo co-founder concept +11

AI's Struggle with Abstraction in Analogies // Shane Morris // MLOps podcast #223 clip 6 days, 14 hours ago | www.youtube.com

abstract automation autonomous autonomous systems +19

The Mind Behind the AI Coding Assistant // Peter Guagenti // MLOps podcast #222 clip 1 week ago | www.youtube.com

ai coding ai coding assistant assistant business +20

Streamlining Model Deployment // Daniel Lenton // AI in Production Talk 1 week ago | www.youtube.com

abstract aiaas ai companies ai infrastructure +21

LLMOps and GenAI at Enterprise Scale - Challenges and Opportunities // Andy McMahon // AI … 1 week ago | www.youtube.com

abstract andy challenges development +17

Data Labeling Best Practices // Charles Brecque // AI in Production Conference Lightning Talk 1 week ago | www.youtube.com

abstract best practices bio conference +17

Explaining ChatGPT to Anyone in 10 Minutes // Cameron Wolfe // AI in Production Conference 1 week ago | www.youtube.com

abstract become chatgpt conference +13

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net