Advances in AI Model Evaluation // Salman Avestimehr // MLOps podcast #230 clip 2 | allainews.com

May 16, 2024, 5:05 p.m. | MLOps.community

MLOps.community www.youtube.com

MLOps podcast #230 with Salman Avestimehr, CEO & Founder of FedML // FedML Nexus AI: Your Generative AI Platform at Scale.

A big thank you to @fedmlai for sponsoring this episode!

Salman discusses the challenges and current practices in assessing the efficacy and reliability of general-purpose LLMs, as well as those developed for specific verticals like healthcare and coding. He points out the lack of benchmarks in certain specialized areas, suggesting a potential increase in benchmark development to aid in …

advances ai model ai platform big ceo challenges clip current evaluation fedml founder general generative generative ai platform mlops mlops podcast platform podcast practices reliability scale

More from www.youtube.com / MLOps.community

Evaluating Spotify's Multimillion Item Database // Sanket Gupta // MLOps podcast #232 clip 4 hours ago | www.youtube.com

backend challenge engineer gupta +15

From Robotics to Recommender Systems // Miguel Fierro // MLOps Podcast #240 2 days, 2 hours ago | www.youtube.com

abstract computer computer vision control +17

Uber's Michelangelo: Strategic AI Overhaul and Impact // Demetrios Brinkmann // MLOps podcast #239 6 days, 1 hour ago | www.youtube.com

abstract basic capabilities challenges +14

Envisioning a New Era of AI Accessibility // Ryan Carson // MLOps Podcast #231 clip 1 week ago | www.youtube.com

ai developers building community dev +15

AWS Tranium and Inferentia // Kamran Khan and Matthew McClean // MLOps Podcast #238 1 week, 2 days ago | www.youtube.com

abstract accelerators ai accelerators architecture +24

Build Reliable Systems with Chaos Engineering // Benjamin Wilms // MLOps Podcast #237 1 week, 6 days ago | www.youtube.com

abstract benjamin bio build +18

Handling Massive Machine Learning Models // Simon Karasik // MLOps podcast #228 clip 2 weeks ago | www.youtube.com

abstract big cloud cloud storage +15

Avoiding AI POC Purgatory // Sol Rashidi // MLOps podcast #227 clip 2 weeks, 1 day ago | www.youtube.com

abstract building cases ceo +20

Managing Small Knowledge Graphs for Multi-agent Systems // Tom Smoker // MLOps Podcast #236 2 weeks, 2 days ago | www.youtube.com

abstract accuracy agent agents +20

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

Software Engineer III -Full Stack Developer - ModelOps, MLOps

@ JPMorgan Chase & Co. | NY, United States

View on ai-jobs.net

Senior Lead Software Engineer - Full Stack Senior Developer - ModelOps, MLOps

@ JPMorgan Chase & Co. | NY, United States

View on ai-jobs.net

Software Engineer III - Full Stack Developer - ModelOps, MLOps

@ JPMorgan Chase & Co. | NY, United States

View on ai-jobs.net

Research Scientist (m/w/d) - Numerische Simulation Laser-Materie-Wechselwirkung

@ Fraunhofer-Gesellschaft | Freiburg, DE, 79104

View on ai-jobs.net

Research Scientist, Speech Real-Time Dialog

@ Google | Mountain View, CA, USA

View on ai-jobs.net