April 5, 2024, 4:41 a.m. | Tom\'as Vergara-Browne, \'Alvaro Soto, Akiko Aizawa

cs.LG updates on arXiv.org arxiv.org

arXiv:2404.03147v1 Announce Type: new
Abstract: We introduce eigenpruning, a method that removes singular values from weight matrices in an LLM to improve its performance in a particular task. This method is inspired by interpretability methods designed to automatically find subnetworks of a model which solve a specific task. In our tests, the pruned model outperforms the original model by a large margin, while only requiring minimal computation to prune the weight matrices. In the case of a small synthetic task …

abstract arxiv cs.ai cs.lg interpretability llm performance singular solve tests type values

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne