[R] A Tale of Tails: Model Collapse as a Change of Scaling Laws | allainews.com

March 7, 2024, 11:19 a.m. | /u/SunsetOneSix

Machine Learning www.reddit.com

**Paper**: [https://arxiv.org/abs/2402.07043](https://arxiv.org/abs/2402.07043)

**Abstract**:

>As AI model size grows, neural *scaling laws* have become a crucial tool to predict the improvements of large models when increasing capacity and the size of original (human or natural) training data. Yet, the widespread use of popular models means that the ecosystem of online data and text will co-evolve to progressively contain increased amounts of synthesized data. In this paper we ask: *How will the scaling laws change in the inevitable regime where synthetic data …

abstract ai model become capacity data ecosystem human improvements large models laws machinelearning natural popular scaling synthesized text tool training training data will

More from www.reddit.com / Machine Learning

[R] Marcus Hutter's work on Universal Artificial Intelligence 3 hours ago | www.reddit.com

artificial artificial intelligence bayesian biography +11

[D] How to train very shallow (dot product) networks with huge embeddings on a GPU … 6 hours ago | www.reddit.com

cluster compute cpu embedding +11

[P] Google Colab crashes before even training my images dataset. 19 hours ago | www.reddit.com

binary class classification colab +16

[D] Is Evaluating LLM Performance on Domain-Specific QA Sufficient for a Top-Tier Conference Submission? 20 hours ago | www.reddit.com

conference domain five hello +9

[N] Book Lauching: Accelerate Model Training with PyTorch 2.X 20 hours ago | www.reddit.com

ai workloads analyst book boosting +12

[D] Best community/website to find ML engineer interested in hourly work 23 hours ago | www.reddit.com

apis building community custom models +15

[D] What on earth is "discretization" step in Mamba? 1 day, 1 hour ago | www.reddit.com

article core earth form +11

[R] Better & Faster Large Language Models via Multi-token Prediction 1 day, 2 hours ago | www.reddit.com

abstract efficiency future gpt +17

[D] How to use RAG benchmarks in practice 1 day, 6 hours ago | www.reddit.com

context datasets however machinelearning +5

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net