all AI news
Advances in AI Model Evaluation // Salman Avestimehr // MLOps podcast #230 clip 2
May 16, 2024, 5:05 p.m. | MLOps.community
MLOps.community www.youtube.com
A big thank you to @fedmlai for sponsoring this episode!
Salman discusses the challenges and current practices in assessing the efficacy and reliability of general-purpose LLMs, as well as those developed for specific verticals like healthcare and coding. He points out the lack of benchmarks in certain specialized areas, suggesting a potential increase in benchmark development to aid in …
advances ai model ai platform big ceo challenges clip current evaluation fedml founder general generative generative ai platform mlops mlops podcast platform podcast practices reliability scale
More from www.youtube.com / MLOps.community
Jobs in AI, ML, Big Data
Senior Machine Learning Engineer
@ GPTZero | Toronto, Canada
Software Engineer III -Full Stack Developer - ModelOps, MLOps
@ JPMorgan Chase & Co. | NY, United States
Senior Lead Software Engineer - Full Stack Senior Developer - ModelOps, MLOps
@ JPMorgan Chase & Co. | NY, United States
Software Engineer III - Full Stack Developer - ModelOps, MLOps
@ JPMorgan Chase & Co. | NY, United States
Research Scientist (m/w/d) - Numerische Simulation Laser-Materie-Wechselwirkung
@ Fraunhofer-Gesellschaft | Freiburg, DE, 79104
Research Scientist, Speech Real-Time Dialog
@ Google | Mountain View, CA, USA