Feb. 20, 2024, 5:44 a.m. | Marco Eckhoff, Markus Reiher

cs.LG updates on arXiv.org arxiv.org

arXiv:2307.15663v2 Announce Type: replace
Abstract: The optimization algorithm and its hyperparameters can significantly affect the training speed and resulting model accuracy in machine learning applications. The wish list for an ideal optimizer includes fast and smooth convergence to low error, low computational demand, and general applicability. Our recently introduced continual resilient (CoRe) optimizer has shown superior performance compared to other state-of-the-art first-order gradient-based optimizers for training lifelong machine learning potentials. In this work we provide an extensive performance comparison of …

abstract accuracy algorithm applications arxiv computational continual convergence core cs.lg demand error general list low machine machine learning machine learning applications math.oc model accuracy optimization physics.comp-ph resilient solution speed training type

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne