The Pre-trainer's toolkit: From dataset construction to model scaling | allainews.com

May 8, 2024, 6:20 p.m. | Allen Institute for AI

Allen Institute for AI www.youtube.com

Abstract: Recent breakthroughs in machine learning rely heavily on pre-training techniques, harnessing larger datasets, models, and computational resources to create base-models for subsequent fine-tuning. In this talk, we develop a pre-training toolkit. Drawing from empirical findings, we present methodologies for dataset construction and de-risking large-scale model training. Our discussion touches on both multimodal and language modeling domains. By addressing the entire pre-training pipeline, from dataset creation to downstream evaluation, we aim to create better, more reliable models.

Bio: Samir Yitzhak …

abstract computational construction create dataset datasets fine-tuning machine machine learning model scaling pre-training resources scale scaling talk toolkit trainer training

More from www.youtube.com / Allen Institute for AI

The Pre-trainer's toolkit: From dataset construction to model scaling 1 week, 4 days ago | www.youtube.com

abstract computational construction create +14

Cultivating Insights: AI-Infused Workflow Designs for Nurturing the Scientific Idea Garden 2 weeks, 3 days ago | www.youtube.com

abstract advancement ai-infused beyond +19

Towards a more contextualized view of the web 2 weeks, 3 days ago | www.youtube.com

abstract access context ever +9

Optimization within Latent Spaces 2 weeks, 3 days ago | www.youtube.com

abstract algorithm embeddings good +13

Training Human-AI Teams 2 weeks, 3 days ago | www.youtube.com

abstract ai systems capabilities copilot +23

Making Health Knowledge Accessible Through Personalized Language Processing 2 weeks, 3 days ago | www.youtube.com

abstract decisions general guide +17

Robot Learning by Understanding Egocentric Videos 3 weeks, 6 days ago | www.youtube.com

abstract and natural language processing computer computer vision +24

Project Sidewalk: Crowd+AI Techniques to Map and Assess Every Sidewalk in the World 1 month ago | www.youtube.com

ai techniques every jon map +4

LMQL Programming Large Language Models 1 month, 1 week ago | www.youtube.com

berlin computer computer science eth +19

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net