Chinchilla Explained: Compute-Optimal Massive Language Models | allainews.com

April 9, 2022, 1 p.m. | Edan Meyer

Edan Meyer www.youtube.com

Chinchilla is a massive language released by DeepMind as part of a recent paper that focuses on scaling large language models in a compute-optimal manner. It outperforms recent models like GPT-3, Gopher, and Megatron-Turing NLG that use hundreds of billions of parameters with only 70 billion parameters. They achieve this by training 400 large models to find the optimal ratio of parameters and amount of training data to train a model given a computation budget.

Outline:
0:00 - Overview
1:51 …

compute explained language language models massive

More from www.youtube.com / Edan Meyer

Why Longer Context Isn't Enough 3 days, 15 hours ago | www.youtube.com

advances context context learning data +15

I Talked with Rich Sutton 3 months, 4 weeks ago | www.youtube.com

book contributed good institute +8

GPT-4 Outperforms RL by Studying and Reasoning... 🤔 8 months, 3 weeks ago | www.youtube.com

algorithms authors case claim +12

Learning From YouTube Videos 11 months ago | www.youtube.com

algorithm data environment features +9

Sparks of AGI - What to Know 1 year ago | www.youtube.com

agi artificial artificial general intelligence clearml +19

This Embodied LLM is... 1 year, 1 month ago | www.youtube.com

embodied google llm media +10

Automating Research With GPT API [Livestream] 1 year, 1 month ago | www.youtube.com

api code concept debug +8

GPT-4: What, Why, How? 1 year, 1 month ago | www.youtube.com

chatgpt context demo features +12

ChatGPT Is a Dead End 1 year, 1 month ago | www.youtube.com

chatgpt exploration feedback future +16

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net