Fireside Chat - The Future of LLMs // David Hershey & Daniel Jeffries // LLMs in Prod Con Part 2 | allainews.com

Aug. 17, 2023, 1:09 p.m. | MLOps.community

MLOps.community www.youtube.com

// Abstract
Evaluating the performance of language models (LLMs) is a pressing issue for companies working with generative AI. Defining what makes a model "good" and measuring its performance are challenging due to the diverse range of LLM applications. Existing evaluation methods, including benchmarks and user preference comparisons, have limitations in scalability and objectivity. The future of LLM evaluation lies in scaling testing with machine learning systems, such as reward models that capture user preferences, and simulating user sessions to …

abstract applications chat companies daniel daniel jeffries david diverse future generative good issue language language models llm llm applications llms part performance prod

More from www.youtube.com / MLOps.community

What is AI Quality? // Mohamed Elgendy // MLOps Podcast #229 1 day, 1 hour ago | www.youtube.com

abstract ceo co-founder concept +11

AI's Struggle with Abstraction in Analogies // Shane Morris // MLOps podcast #223 clip 2 days, 2 hours ago | www.youtube.com

abstract automation autonomous autonomous systems +19

The Mind Behind the AI Coding Assistant // Peter Guagenti // MLOps podcast #222 clip 3 days, 2 hours ago | www.youtube.com

ai coding ai coding assistant assistant business +20

Streamlining Model Deployment // Daniel Lenton // AI in Production Talk 3 days, 6 hours ago | www.youtube.com

abstract aiaas ai companies ai infrastructure +21

LLMOps and GenAI at Enterprise Scale - Challenges and Opportunities // Andy McMahon // AI … 3 days, 6 hours ago | www.youtube.com

abstract andy challenges development +17

Data Labeling Best Practices // Charles Brecque // AI in Production Conference Lightning Talk 3 days, 6 hours ago | www.youtube.com

abstract best practices bio conference +17

Explaining ChatGPT to Anyone in 10 Minutes // Cameron Wolfe // AI in Production Conference 3 days, 6 hours ago | www.youtube.com

abstract become chatgpt conference +13

Handling Multi-Terabyte LLM Checkpoints // Simon Karasik // MLOps Podcast #228 4 days, 1 hour ago | www.youtube.com

abstract big cloud cloud storage +15

Leading Enterprise Data Teams // Sol Rashidi // MLOps Podcast #227 1 week ago | www.youtube.com

abstract building cases ceo +20

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net