Rigorous dynamical mean field theory for stochastic gradient descent methods. (arXiv:2210.06591v1 [math-ph]) | allainews.com

Oct. 14, 2022, 1:11 a.m. | Cedric Gerbelot, Emanuele Troiani, Francesca Mignacco, Florent Krzakala, Lenka Zdeborova

cs.LG updates on arXiv.org arxiv.org

We prove closed-form equations for the exact high-dimensional asymptotics of
a family of first order gradient-based methods, learning an estimator (e.g.
M-estimator, shallow neural network, ...) from observations on Gaussian data
with empirical risk minimization. This includes widely used algorithms such as
stochastic gradient descent (SGD) or Nesterov acceleration. The obtained
equations match those resulting from the discretization of dynamical mean-field
theory (DMFT) equations from statistical physics when applied to gradient flow.
Our proof method allows us to give an …

arxiv gradient math mean stochastic theory

More from arxiv.org / cs.LG updates on arXiv.org

Stochastic Optimal Control Matching 5 hours ago | arxiv.org

arxiv control cs.lg cs.na +6

Value Approximation for Two-Player General-Sum Differential Games with State Constraints 5 hours ago | arxiv.org

abstract approximation arxiv constraints +20

Can We Edit Multimodal Large Language Models? 5 hours ago | arxiv.org

arxiv cs.ai cs.cl cs.cv +9

XIMAGENET-12: An Explainable AI Benchmark Dataset for Model Robustness Evaluation 5 hours ago | arxiv.org

ai benchmark arxiv benchmark cs.cv +7

Generalized Schr\"odinger Bridge Matching 5 hours ago | arxiv.org

arxiv bridge cs.lg generalized +3

Tight bounds on Pauli channel learning without entanglement 5 hours ago | arxiv.org

abstract algorithms arxiv cs.it +9

Automated mapping of virtual environments with visual predictive coding 5 hours ago | arxiv.org

abstract access algorithms arxiv +28

Confident Feature Ranking 5 hours ago | arxiv.org

abstract arxiv cs.ai cs.lg +14

Integrated Sensing-Communication-Computation for Edge Artificial Intelligence 5 hours ago | arxiv.org

abstract advanced and edge ai artificial +27

Senior Data Engineer

@ Publicis Groupe | New York City, United States

View on ai-jobs.net

Associate Principal Robotics Engineer - Research.

@ Dyson | United Kingdom - Hullavington Office

View on ai-jobs.net

Duales Studium mit vertiefter Praxis: Bachelor of Science Künstliche Intelligenz und Data Science (m/w/d)

@ Gerresheimer | Wackersdorf, Germany

View on ai-jobs.net

AI/ML Engineer (TS/SCI) {S}

@ ARKA Group, LP | Aurora, Colorado, United States

View on ai-jobs.net

Data Integration Engineer

@ Find.co | Sliema

View on ai-jobs.net

Data Engineer

@ Q2 | Bengaluru, India

View on ai-jobs.net