Rigorous dynamical mean field theory for stochastic gradient descent methods. (arXiv:2210.06591v1 [math-ph]) | allainews.com

Oct. 14, 2022, 1:14 a.m. | Cedric Gerbelot, Emanuele Troiani, Francesca Mignacco, Florent Krzakala, Lenka Zdeborova

stat.ML updates on arXiv.org arxiv.org

We prove closed-form equations for the exact high-dimensional asymptotics of
a family of first order gradient-based methods, learning an estimator (e.g.
M-estimator, shallow neural network, ...) from observations on Gaussian data
with empirical risk minimization. This includes widely used algorithms such as
stochastic gradient descent (SGD) or Nesterov acceleration. The obtained
equations match those resulting from the discretization of dynamical mean-field
theory (DMFT) equations from statistical physics when applied to gradient flow.
Our proof method allows us to give an …

arxiv gradient math mean stochastic theory

More from arxiv.org / stat.ML updates on arXiv.org

Simultaneous upper and lower bounds of American option prices with hedging via neural networks 1 day, 1 hour ago | arxiv.org

abstract arxiv form math.pr +11

Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF 2 days, 1 hour ago | arxiv.org

accounting arxiv context cs.ai +6

Hacking Task Confounder in Meta-Learning 2 days, 1 hour ago | arxiv.org

abstract arxiv cs.lg hacking +12

Reflection coupling for unadjusted generalized Hamiltonian Monte Carlo in the nonconvex stochastic gradient case 2 days, 1 hour ago | arxiv.org

abstract algorithms arxiv case +10

Provable Reward-Agnostic Preference-Based Reinforcement Learning 2 days, 1 hour ago | arxiv.org

abstract agent arxiv cs.ai +16

Mastering Diverse Domains through World Models 2 days, 1 hour ago | arxiv.org

abstract algorithm algorithms application +22

Precise Asymptotics for Spectral Methods in Mixed Generalized Linear Models 2 days, 1 hour ago | arxiv.org

abstract arxiv cs.it cs.lg +14

Additive Covariance Matrix Models: Modelling Regional Electricity Net-Demand in Great Britain 2 days, 1 hour ago | arxiv.org

abstract arxiv britain consumption +18

Learning Algorithm Generalization Error Bounds via Auxiliary Distributions 2 days, 1 hour ago | arxiv.org

abstract algorithm arxiv cs.it +16

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Analyst

@ SEAKR Engineering | Englewood, CO, United States

View on ai-jobs.net

Data Analyst II

@ Postman | Bengaluru, India

View on ai-jobs.net

Data Architect

@ FORSEVEN | Warwick, GB

View on ai-jobs.net

Director, Data Science

@ Visa | Washington, DC, United States

View on ai-jobs.net

Senior Manager, Data Science - Emerging ML

@ Capital One | McLean, VA

View on ai-jobs.net