Andrej Karpathy's Llama 3 review

April 18, 2024, 8:50 p.m. |

Simon Willison's Weblog simonwillison.net

The most interesting coverage I've seen so far of Meta's Llama 3 models (8b and 70b so far, 400b promised later).

Andrej notes that Llama 3 trained on 15 trillion tokens - up from 2 trillion for Llama 2 - and they used that many even for the smaller 8b model, 75x more than the chinchilla scaling laws would suggest.

The tokenizer has also changed - they now use 128,000 tokens, up from 32,000. This …

70b ai andrej karpathy andrejkarpathy coverage generativeai llama llama 2 llama 3 llms meta notes review tokens

Visit resource

More from simonwillison.net / Simon Willison's Weblog

We can have a different web 10 hours ago | simonwillison.net

audio dog headphones mollywhite +2

Quoting Tom Eastman 10 hours ago | simonwillison.net

five internet remember when text +2

Llama 3 prompt formats 18 hours ago | simonwillison.net

ai clear documentation every +12

Introducing the Claude Team plan and iOS app 20 hours ago | simonwillison.net

access anthropic app claude +11

Save the Web by Being Nice 1 day, 10 hours ago | simonwillison.net

andrew article blog blogging +6

Quoting LMSYS 1 day, 16 hours ago | simonwillison.net

ai api commercial community +9

Quoting D. Richard Hipp 1 day, 22 hours ago | simonwillison.net

analysis code cpu decoding +11

How an empty S3 bucket can make your AWS bill explode 2 days, 1 hour ago | simonwillison.net

aws bill empty s3 +4

My approach to HTML web components 2 days, 1 hour ago | simonwillison.net

components frameworks html isn +11

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Business Data Scientist, gTech Ads

@ Google | Mexico City, CDMX, Mexico

View on ai-jobs.net

Lead, Data Analytics Operations

@ Zocdoc | Pune, Maharashtra, India

View on ai-jobs.net

View more jobs

all AI news

Andrej Karpathy's Llama 3 review

More from simonwillison.net / Simon Willison's Weblog

Jobs in AI, ML, Big Data

Data Architect

Data ETL Engineer

Lead GNSS Data Scientist

Senior Machine Learning Engineer (MLOps)

Business Data Scientist, gTech Ads

Lead, Data Analytics Operations