Layer-wise Linear Mode Connectivity | allainews.com

March 20, 2024, 4:43 a.m. | Linara Adilova, Maksym Andriushchenko, Michael Kamp, Asja Fischer, Martin Jaggi

cs.LG updates on arXiv.org arxiv.org

arXiv:2307.06966v3 Announce Type: replace
Abstract: Averaging neural network parameters is an intuitive method for fusing the knowledge of two independent models. It is most prominently used in federated learning. If models are averaged at the end of training, this can only lead to a good performing model if the loss surface of interest is very particular, i.e., the loss in the midpoint between the two models needs to be sufficiently low. This is impossible to guarantee for the non-convex losses …

abstract arxiv connectivity cs.lg federated learning good independent knowledge layer linear loss network neural network parameters surface the end training type wise

More from arxiv.org / cs.LG updates on arXiv.org

DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy Learning 16 hours ago | arxiv.org

abstract agents arxiv benchmark +20

Generative AI Beyond LLMs: System Implications of Multi-Modal Generation 16 hours ago | arxiv.org

abstract ai models arxiv beyond +27

Enabling Accelerators for Graph Computing 16 hours ago | arxiv.org

abstract accelerators applications arxiv +24

DUCK: Distance-based Unlearning via Centroid Kinematics 16 hours ago | arxiv.org

abstract acquired artificial artificial intelligence +16

Motion Informed Needle Segmentation in Ultrasound Images 16 hours ago | arxiv.org

abstract arxiv availability cs.cv +10

A ripple in time: a discontinuity in American history 16 hours ago | arxiv.org

abstract arxiv cs.ai cs.cl +13

An algorithm for forensic toolmark comparisons 16 hours ago | arxiv.org

abstract algorithm analysis arxiv +12

The Chosen One: Consistent Characters in Text-to-Image Diffusion Models 16 hours ago | arxiv.org

arxiv characters consistent cs.cv +9

On Linear Separation Capacity of Self-Supervised Representation Learning 16 hours ago | arxiv.org

abstract adept advances arxiv +17

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net