[D] Has double descent/grokking changed how people train models? | allainews.com

Aug. 5, 2022, 11:42 a.m. | /u/user_--

Machine Learning www.reddit.com

These papers indicate that when a large model is trained on a small dataset for a very long time, the test loss first goes down, then up when it overfits, but eventually back down even lower, and the model generalizes correctly.

Do people take advantage of this in practice to get good, generalized models on small datasets? Do people often train longer now in order to get better models? Or has this not caught on in practice for some reason? …

machinelearning people

More from www.reddit.com / Machine Learning

[D] Reproducing and Comparing Models from Research - Best Practices? an hour ago | www.reddit.com

analysis apply best practices big +13

[P] Training a VQGAN but GAN loss keeps going up 10 hours ago | www.reddit.com

image imagenet look loss +8

[R] [2404.10667] VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time 12 hours ago | www.reddit.com

audio generated machinelearning

[N] Feds appoint “AI doomer” to run US AI safety institute 14 hours ago | www.reddit.com

ai development article chance development +16

[D] Is there a way to determine if the representations a model learns are spherical … 16 hours ago | www.reddit.com

deep learning embeddings examples feature +4

[R] RuleOpt: Optimization-Based Rule Learning for Classification 17 hours ago | www.reddit.com

algorithm classification ensemble extraction +13

[D] LSTM Time Series Forecasting 17 hours ago | www.reddit.com

data forecast forecasting however +10

[R] ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models 19 hours ago | www.reddit.com

abstract agent complexity core +20

[D] Question: Time-series decoding to non-temporal latent space? 19 hours ago | www.reddit.com

data decoding machine machine learning +6

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

Data Engineer

@ Paxos | Remote - United States

View on ai-jobs.net

Data Analytics Specialist

@ Media.Monks | Kuala Lumpur

View on ai-jobs.net

Software Engineer III- Pyspark

@ JPMorgan Chase & Co. | India

View on ai-jobs.net

Engineering Manager, Data Infrastructure

@ Dropbox | Remote - Canada

View on ai-jobs.net

Senior AI NLP Engineer

@ Hyro | Tel Aviv-Yafo, Tel Aviv District, Israel

View on ai-jobs.net