Faster Convergence of Stochastic Accelerated Gradient Descent under Interpolation | allainews.com

April 4, 2024, 4:42 a.m. | Aaron Mishkin, Mert Pilanci, Mark Schmidt

cs.LG updates on arXiv.org arxiv.org

arXiv:2404.02378v1 Announce Type: cross
Abstract: We prove new convergence rates for a generalized version of stochastic Nesterov acceleration under interpolation conditions. Unlike previous analyses, our approach accelerates any stochastic gradient method which makes sufficient progress in expectation. The proof, which proceeds using the estimating sequences framework, applies to both convex and strongly convex functions and is easily specialized to accelerated SGD under the strong growth condition. In this special case, our analysis reduces the dependence on the strong growth constant …

abstract arxiv convergence cs.lg faster framework generalized gradient math.oc progress prove stochastic type

More from arxiv.org / cs.LG updates on arXiv.org

Tao: Re-Thinking DL-based Microarchitecture Simulation 13 minutes ago | arxiv.org

abstract arxiv cs.ar cs.lg +12

Towards a Systems Theory of Algorithms 13 minutes ago | arxiv.org

abstract algorithms arxiv code +16

Object Detection for Automated Coronary Artery Using Deep Learning 13 minutes ago | arxiv.org

abstract arxiv automated cs.cv +21

On the Role of the Action Space in Robot Manipulation Learning and Sim-to-Real Transfer 13 minutes ago | arxiv.org

abstract agents arxiv cs.lg +16

Computer Vision for Increased Operative Efficiency via Identification of Instruments in the Neurosurgical Operating Room: … 13 minutes ago | arxiv.org

abstract artificial artificial intelligence arxiv +18

A New Random Reshuffling Method for Nonsmooth Nonconvex Finite-sum Optimization 13 minutes ago | arxiv.org

abstract applications arxiv case +16

nach0: Multimodal Natural and Chemical Languages Foundation Model 13 minutes ago | arxiv.org

abstract arxiv biomedical creative +24

How good are Large Language Models on African Languages? 13 minutes ago | arxiv.org

abstract arxiv context cs.ai +19

Using Skew to Assess the Quality of GAN-generated Image Features 13 minutes ago | arxiv.org

abstract advancement adversarial arxiv +20

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Senior Principal, Product Strategy Operations, Cloud Data Analytics

@ Google | Sunnyvale, CA, USA; Austin, TX, USA

View on ai-jobs.net

Data Scientist - HR BU

@ ServiceNow | Hyderabad, India

View on ai-jobs.net