Scalable Extraction of Training Data from (Production) Language Models (Paper Explained) | allainews.com

Dec. 3, 2023, 5 p.m. | Yannic Kilcher

Yannic Kilcher www.youtube.com

#chatgpt #privacy #promptengineering

Researchers were able to get giant amounts of training data out of ChatGPT by simply asking it to repeat a word many times over, which causes the model to diverge and start spitting out memorized text.
Why does this happen? And how much of their training data do such models really memorize verbatim?

OUTLINE:
0:00 - Intro
8:05 - Extractable vs Discoverable Memorization
14:00 - Models leak more data than previously thought
20:25 - Some data is …

chatgpt data explained extraction language language models paper privacy production promptengineering researchers scalable text training training data word

More from www.youtube.com / Yannic Kilcher

ORPO: Monolithic Preference Optimization without Reference Model (Paper Explained) 2 weeks, 4 days ago | www.youtube.com

abstract algorithms alignment building +14

[ML News] Chips, Robots, and Models 2 weeks, 5 days ago | www.youtube.com

accelerator adobe ai training ai training data +22

TransformerFAM: Feedback attention is working memory 3 weeks ago | www.youtube.com

abstract architecture attention complexity +14

[ML News] Devin exposed | NeurIPS track for high school students 3 weeks, 1 day ago | www.youtube.com

ai-powered ai software ai software engineer devin +15

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention 3 weeks, 4 days ago | www.youtube.com

abstract attention computation context +15

[ML News] Llama 3 changes the game 3 weeks, 5 days ago | www.youtube.com

bitcoin btc game license +7

Hugging Face got hacked 1 month ago | www.youtube.com

bitcoin btc eth ethereum +5

[ML News] Microsoft to spend 100 BILLION DOLLARS on supercomputer (& more industry news) 1 month ago | www.youtube.com

billion industry machine machine learning +7

[ML News] Jamba, CMD-R+, and other new models (yes, I know this is like a … 1 month ago | www.youtube.com

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net