706: Large Language Model Leaderboards and Benchmarks — with Caterina Constantinescu | allainews.com

Aug. 18, 2023, 11 a.m. | Super Data Science: ML & AI Podcast with Jon Krohn

Super Data Science Podcast with Jon Krohn www.youtube.com

#LargeLanguageModels #LLMLeaderboard #LLMEvaluation

In this episode, @JonKrohnLearns is joined by Caterina Constantinescu who dives deep into Large Language Models (LLMs), spotlighting top leaderboards, evaluation benchmarks, and real-world user perceptions. Plus, discover the challenges of dataset contamination and the intricacies of platforms like HELM and Chatbot Arena.

Additional materials: https://www.superdatascience.com/706

Interested in sponsoring a SuperDataScience Podcast episode? Visit https://jonkrohn.com/podcast for sponsorship information.

arena benchmarks challenges chatbot dataset evaluation helm language language model language models large language large language model large language models largelanguagemodels llms platforms world

More from www.youtube.com / Super Data Science Podcast with Jon Krohn

How to Ensure Sucess for Your Solar Project 22 hours ago | www.youtube.com

advanced advances ceo climate +13

AI Can Prevent Climate Change Crises 1 day, 2 hours ago | www.youtube.com

advanced advances ceo change +13

Back-End Engineer vs Data Scientist: Who's Needed More? 1 day, 22 hours ago | www.youtube.com

advanced advances ceo climate +14

70% of Solar Projects Fail. Now AI Can Fix This. 2 days, 2 hours ago | www.youtube.com

advanced advances ceo climate +14

2 Pro Hacks to Avoid AI Hallucinations 2 days, 19 hours ago | www.youtube.com

advanced advances ai hallucinations ceo +14

784: Aligning Large Language Models — with Sinan Ozdemir 3 days, 2 hours ago | www.youtube.com

began conversation definitions generativeai +13

AI Finds the Perfect Places for Green Energy Projects 3 days, 19 hours ago | www.youtube.com

advanced advances ceo climate +15

The Evolution and Impact of AI on Journalism 4 days, 2 hours ago | www.youtube.com

advanced advances ceo climate +14

Can a Product Leader Make a Good CEO? 4 days, 19 hours ago | www.youtube.com

advanced advances ceo climate +14

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net