Ginger: An Efficient Curvature Approximation with Linear Complexity for General Neural Networks | allainews.com

Feb. 6, 2024, 5:44 a.m. | Yongchang Hao Yanshuai Cao Lili Mou

cs.LG updates on arXiv.org arxiv.org

Second-order optimization approaches like the generalized Gauss-Newton method are considered more powerful as they utilize the curvature information of the objective function with preconditioning matrices. Albeit offering tempting theoretical benefits, they are not easily applicable to modern deep learning. The major reason is due to the quadratic memory and cubic time complexity to compute the inverse of the matrix. These requirements are infeasible even with state-of-the-art hardware. In this work, we propose Ginger, an eigendecomposition for the inverse of the …

approximation benefits complexity cs.ai cs.lg deep learning function gauss general generalized information linear major math.oc memory modern networks neural networks optimization reason stat.ml

More from arxiv.org / cs.LG updates on arXiv.org

Red-Teaming for Generative AI: Silver Bullet or Security Theater? 1 day, 12 hours ago | arxiv.org

abstract arxiv concerns cs.cy +15

Efficient Data-Driven MPC for Demand Response of Commercial Buildings 1 day, 12 hours ago | arxiv.org

abstract arxiv buildings commercial +20

BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry 1 day, 12 hours ago | arxiv.org

arxiv cs.cv cs.lg diffusion +5

Data-Driven Physics-Informed Neural Networks: A Digital Twin Perspective 1 day, 12 hours ago | arxiv.org

abstract arxiv automated construction +26

Testing the Segment Anything Model on radiology data 1 day, 12 hours ago | arxiv.org

abstract applications arxiv become +20

Robust Point Matching with Distance Profiles 1 day, 12 hours ago | arxiv.org

abstract analyze arxiv cs.lg +13

Cell Maps Representation For Lung Adenocarcinoma Growth Patterns Classification In Whole Slide Images 1 day, 12 hours ago | arxiv.org

abstract arxiv behavior classification +18

Improved Baselines with Visual Instruction Tuning 1 day, 12 hours ago | arxiv.org

abstract academic arxiv clip +25

Calorimeter shower superresolution 1 day, 12 hours ago | arxiv.org

abstract arxiv challenge computational +16

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net