April 19, 2024, 4:41 a.m. | Anna Shalova, Andr\'e Schlichting, Mark Peletier

cs.LG updates on arXiv.org arxiv.org

arXiv:2404.12293v1 Announce Type: new
Abstract: We study the limiting dynamics of a large class of noisy gradient descent systems in the overparameterized regime. In this regime the set of global minimizers of the loss is large, and when initialized in a neighbourhood of this zero-loss set a noisy gradient descent algorithm slowly evolves along this set. In some cases this slow evolution has been related to better generalisation properties. We characterize this evolution for the broad class of noisy gradient …

abstract algorithm analysis arxiv class cs.lg dynamics global gradient loss math.pr noise set singular study systems type

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US