Scalable Extraction of Training Data from (Production) Language Models (Paper Explained) | allainews.com

Dec. 3, 2023, 5 p.m. | Yannic Kilcher

Yannic Kilcher www.youtube.com

#chatgpt #privacy #promptengineering

Researchers were able to get giant amounts of training data out of ChatGPT by simply asking it to repeat a word many times over, which causes the model to diverge and start spitting out memorized text.
Why does this happen? And how much of their training data do such models really memorize verbatim?

OUTLINE:
0:00 - Intro
8:05 - Extractable vs Discoverable Memorization
14:00 - Models leak more data than previously thought
20:25 - Some data is …

chatgpt data explained extraction language language models paper privacy production promptengineering researchers scalable text training training data word

More from www.youtube.com / Yannic Kilcher

ORPO: Monolithic Preference Optimization without Reference Model (Paper Explained) 4 days, 10 hours ago | www.youtube.com

abstract algorithms alignment building +14

[ML News] Chips, Robots, and Models 5 days, 5 hours ago | www.youtube.com

accelerator adobe ai training ai training data +22

TransformerFAM: Feedback attention is working memory 1 week ago | www.youtube.com

abstract architecture attention complexity +14

[ML News] Devin exposed | NeurIPS track for high school students 1 week, 1 day ago | www.youtube.com

ai-powered ai software ai software engineer devin +15

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention 1 week, 4 days ago | www.youtube.com

abstract attention computation context +15

[ML News] Llama 3 changes the game 1 week, 5 days ago | www.youtube.com

bitcoin btc game license +7

Hugging Face got hacked 2 weeks, 4 days ago | www.youtube.com

bitcoin btc eth ethereum +5

[ML News] Microsoft to spend 100 BILLION DOLLARS on supercomputer (& more industry news) 2 weeks, 6 days ago | www.youtube.com

billion industry machine machine learning +7

[ML News] Jamba, CMD-R+, and other new models (yes, I know this is like a … 3 weeks, 1 day ago | www.youtube.com

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net