Lottery Tickets on a Data Diet: Finding Initializations with Sparse Trainable Networks. (arXiv:2206.01278v1 [cs.LG]) | allainews.com

June 6, 2022, 1:11 a.m. | Mansheej Paul, Brett W. Larsen, Surya Ganguli, Jonathan Frankle, Gintare Karolina Dziugaite

stat.ML updates on arXiv.org arxiv.org

A striking observation about iterative magnitude pruning (IMP; Frankle et al.
2020) is that $\unicode{x2014}$ after just a few hundred steps of dense
training $\unicode{x2014}$ the method can find a sparse sub-network that can be
trained to the same accuracy as the dense network. However, the same does not
hold at step 0, i.e. random initialization. In this work, we seek to understand
how this early phase of pre-training leads to a good initialization for IMP
both through the lens …

arxiv data diet networks

More from arxiv.org / stat.ML updates on arXiv.org

Non-asymptotic estimates for accelerated high order Langevin Monte Carlo algorithms 2 days, 12 hours ago | arxiv.org

abstract algorithms arxiv convergence +9

Entropic covariance models 3 days, 12 hours ago | arxiv.org

abstract arxiv challenges covariance +12

Bump hunting through density curvature features 3 days, 12 hours ago | arxiv.org

abstract arxiv construct data +18

Uncertainty quantification in metric spaces 3 days, 12 hours ago | arxiv.org

abstract algorithms arxiv datasets +15

Guiding adaptive shrinkage by co-data to improve regression-based prediction and feature selection 3 days, 12 hours ago | arxiv.org

abstract arxiv clinical data +17

A general error analysis for randomized low-rank approximation with application to data assimilation 3 days, 12 hours ago | arxiv.org

abstract algebra algorithms analysis +17

Calabi-Yau Four/Five/Six-folds as $\mathbb{P}^n_\textbf{w}$ Hypersurfaces: Machine Learning, Approximation, and Generation 4 days, 12 hours ago | arxiv.org

abstract approximation arxiv five +17

Bayesian Quantile Regression with Subset Selection: A Posterior Summarization Perspective 4 days, 12 hours ago | arxiv.org

abstract arxiv bayesian distribution +16

The Projected Covariance Measure for assumption-lean variable significance testing 4 days, 12 hours ago | arxiv.org

abstract arxiv covariance lean +14

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net