LoRA Dropout as a Sparsity Regularizer for Overfitting Control | allainews.com

April 16, 2024, 4:42 a.m. | Yang Lin, Xinyu Ma, Xu Chu, Yujie Jin, Zhibang Yang, Yasha Wang, Hong Mei

cs.LG updates on arXiv.org arxiv.org

arXiv:2404.09610v1 Announce Type: new
Abstract: Parameter-efficient fine-tuning methods, represented by LoRA, play an essential role in adapting large-scale pre-trained models to downstream tasks. However, fine-tuning LoRA-series models also faces the risk of overfitting on the training dataset, and yet there's still a lack of theoretical guidance and practical mechanism to control overfitting on LoRA-based PEFT methods. In this paper, we propose a LoRA Dropout mechanism for the LoRA-based methods by introducing random noises to the learnable low-rank matrices and increasing …

abstract arxiv control cs.ai cs.lg dataset dropout fine-tuning guidance however lora overfitting practical pre-trained models risk role scale series sparsity tasks training type

More from arxiv.org / cs.LG updates on arXiv.org

Explore-Go: Leveraging Exploration for Generalisation in Deep Reinforcement Learning now | arxiv.org

A novel approach to graph distinction through GENEOs and permutants a second ago | arxiv.org

abstract analysis approximation arxiv +14

Beyond the Mean: Differentially Private Prototypes for Private Transfer Learning 2 seconds ago | arxiv.org

abstract algorithm arxiv become +23

A Federated Online Restless Bandit Framework for Cooperative Resource Allocation 2 seconds ago | arxiv.org

abstract arxiv cs.lg dynamics +11

Meta-Learning Neural Procedural Biases 3 seconds ago | arxiv.org

abstract arxiv biases cs.lg +14

Reinforcement Learning for High-Level Strategic Control in Tower Defense Games 4 seconds ago | arxiv.org

abstract arxiv challenge control +18

Heuristic Learning with Graph Neural Networks: A Unified Framework for Link Prediction 5 seconds ago | arxiv.org

abstract arxiv challenges cs.lg +18

A Generic Layer Pruning Method for Signal Modulation Recognition Deep Learning Models 6 seconds ago | arxiv.org

abstract application arxiv classification +17

Efficient Neural Common Neighbor for Temporal Graph Link Prediction 6 seconds ago | arxiv.org

abstract arxiv cs.ai cs.lg +21

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

Customer Data Analyst with Spanish

@ Michelin | Voluntari

View on ai-jobs.net

HC Data Analyst - Senior

@ Leidos | 1662 Intelligence Community Campus - Bethesda MD

View on ai-jobs.net

Healthcare Research & Data Analyst- Infectious, Niche, Rare Disease

@ Clarivate | Remote (121- Massachusetts)

View on ai-jobs.net

Data Analyst (maternity leave cover)

@ Clarivate | R155-Belgrade

View on ai-jobs.net

Sales Enablement Data Analyst (Remote)

@ CrowdStrike | USA TX Remote

View on ai-jobs.net