March 15, 2024, 4:42 a.m. | Ruiyi Zhang, Rushi Qiang, Sai Ashish Somayajula, Pengtao Xie

cs.LG updates on arXiv.org arxiv.org

arXiv:2403.09113v1 Announce Type: cross
Abstract: Large-scale pretraining followed by task-specific finetuning has achieved great success in various NLP tasks. Since finetuning all parameters of large pretrained models poses substantial computational and memory challenges, several efficient finetuning methods have been developed. Among them, low-rank adaptation (LoRA), which finetunes low-rank incremental update matrices on top of frozen pretrained weights, has proven particularly effective. Nonetheless, LoRA's uniform rank assignment across all layers, along with its reliance on an exhaustive search to find the …

abstract arxiv challenges computational cs.ai cs.cl cs.lg finetuning lora low low-rank adaptation matrix memory meta nlp parameters pretrained models pretraining scale success tasks them type

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne