Nov. 3, 2022, 1:13 a.m. | Mohamed Elsayed, A. Rupam Mahmood

cs.LG updates on arXiv.org arxiv.org

Second-order optimization uses curvature information about the objective
function, which can help in faster convergence. However, such methods typically
require expensive computation of the Hessian matrix, preventing their usage in
a scalable way. The absence of efficient ways of computation drove the most
widely used methods to focus on first-order approximations that do not capture
the curvature information. In this paper, we develop HesScale, a scalable
approach to approximating the diagonal of the Hessian matrix, to incorporate
second-order information in …

arxiv computation scalable

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

AI Scientist/Engineer

@ OKX | Singapore

Research Engineering/ Scientist Associate I

@ The University of Texas at Austin | AUSTIN, TX

Senior Data Engineer

@ Algolia | London, England

Fundamental Equities - Vice President, Equity Quant Research Analyst (Income & Value Investment Team)

@ BlackRock | NY7 - 50 Hudson Yards, New York

Snowflake Data Analytics

@ Devoteam | Madrid, Spain