Fireside Chat - The Future of LLMs // David Hershey & Daniel Jeffries // LLMs in Prod Con Part 2 | allainews.com

Aug. 17, 2023, 1:09 p.m. | MLOps.community

MLOps.community www.youtube.com

// Abstract
Evaluating the performance of language models (LLMs) is a pressing issue for companies working with generative AI. Defining what makes a model "good" and measuring its performance are challenging due to the diverse range of LLM applications. Existing evaluation methods, including benchmarks and user preference comparisons, have limitations in scalability and objectivity. The future of LLM evaluation lies in scaling testing with machine learning systems, such as reward models that capture user preferences, and simulating user sessions to …

abstract applications chat companies daniel daniel jeffries david diverse future generative good issue language language models llm llm applications llms part performance prod

More from www.youtube.com / MLOps.community

AI Quality in Mo's Eyes // Mohamed Elgendy // MLOps Podcast #229 clip 4 hours ago | www.youtube.com

abstract ceo co-founder concept +11

AI Innovations: The Power of Feature Platforms // MLOps Mini Summit #6 1 day, 3 hours ago | www.youtube.com

abstract ai innovations build building +19

FEDML Nexus AI: Your Generative AI Platform at Scale // Salman Avestimehr // MLOps podcast … 2 days, 3 hours ago | www.youtube.com

abstract ai applications ai platform applications +15

What is AI Quality? // Mohamed Elgendy // MLOps Podcast #229 6 days, 2 hours ago | www.youtube.com

abstract ceo co-founder concept +11

AI's Struggle with Abstraction in Analogies // Shane Morris // MLOps podcast #223 clip 1 week ago | www.youtube.com

abstract automation autonomous autonomous systems +19

The Mind Behind the AI Coding Assistant // Peter Guagenti // MLOps podcast #222 clip 1 week, 1 day ago | www.youtube.com

ai coding ai coding assistant assistant business +20

Streamlining Model Deployment // Daniel Lenton // AI in Production Talk 1 week, 1 day ago | www.youtube.com

abstract aiaas ai companies ai infrastructure +21

LLMOps and GenAI at Enterprise Scale - Challenges and Opportunities // Andy McMahon // AI … 1 week, 1 day ago | www.youtube.com

abstract andy challenges development +17

Data Labeling Best Practices // Charles Brecque // AI in Production Conference Lightning Talk 1 week, 1 day ago | www.youtube.com

abstract best practices bio conference +17

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net