Exact Mean Square Linear Stability Analysis for SGD | allainews.com

Feb. 13, 2024, 5:44 a.m. | Rotem Mulayoff Tomer Michaeli

cs.LG updates on arXiv.org arxiv.org

The dynamical stability of optimization methods at the vicinity of minima of the loss has recently attracted significant attention. For gradient descent (GD), stable convergence is possible only to minima that are sufficiently flat w.r.t. the step size, and those have been linked with favorable properties of the trained model. However, while the stability threshold of GD is well-known, to date, no explicit expression has been derived for the exact threshold of stochastic GD (SGD). In this paper, we derive …

analysis attention convergence cs.lg gradient linear loss mean optimization square stability

More from arxiv.org / cs.LG updates on arXiv.org

Challenging the Human-in-the-loop in Algorithmic Decision-making now | arxiv.org

Off-the-Shelf Neural Network Architectures for Forex Time Series Prediction come at a Cost a second ago | arxiv.org

abstract analyze ann architecture +21

Cost-Effective Fault Tolerance for CNNs Using Parameter Vulnerability Based Hardening and Pruning a second ago | arxiv.org

abstract applications arxiv become +17

Cyclical Weight Consolidation: Towards Solving Catastrophic Forgetting in Serial Federated Learning 2 seconds ago | arxiv.org

abstract algorithms arxiv attention +19

Hi-GMAE: Hierarchical Graph Masked Autoencoders 3 seconds ago | arxiv.org

abstract arxiv autoencoders cs.lg +17

Harnessing Collective Structure Knowledge in Data Augmentation for Graph Neural Networks 4 seconds ago | arxiv.org

abstract art arxiv augmentation +23

Sample-Efficient Constrained Reinforcement Learning with General Parameterization 4 seconds ago | arxiv.org

abstract agent arxiv building +14

Historically Relevant Event Structuring for Temporal Knowledge Graph Reasoning 5 seconds ago | arxiv.org

abstract arxiv correlations cs.ai +19

Distributed Event-Based Learning via ADMM 6 seconds ago | arxiv.org

abstract agents arxiv communication +15

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net