Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models | allainews.com

April 1, 2024, 4:42 a.m. | Tsendsuren Munkhdalai, Youzheng Chen, Khe Chai Sim, Fadi Biadsy, Tara Sainath, Pedro Moreno Mengibar

cs.LG updates on arXiv.org arxiv.org

arXiv:2403.19709v1 Announce Type: cross
Abstract: Parameter efficient adaptation methods have become a key mechanism to train large pre-trained models for downstream tasks. However, their per-task parameter overhead is considered still high when the number of downstream tasks to adapt for is large. We introduce an adapter module that has a better efficiency in large scale multi-task adaptation scenario. Our adapter is hierarchical in terms of how the adapter parameters are allocated. The adapter consists of a single shared controller network …

abstract adapt adapter arxiv become cs.ai cs.cl cs.lg cs.ne eess.as hierarchical however key per pre-trained models speech tasks train type

More from arxiv.org / cs.LG updates on arXiv.org

Provably Unlearnable Examples now | arxiv.org

Interpretable Network Visualizations: A Human-in-the-Loop Approach for Post-hoc Explainability of CNN-based Image Classification now | arxiv.org

abstract art arxiv biases +21

Coefficient Decomposition for Spectral Graph Convolution a second ago | arxiv.org

abstract arxiv convolution convolutional +22

End-to-End Reinforcement Learning of Curative Curtailment with Partial Measurement Availability 2 seconds ago | arxiv.org

abstract arxiv availability cars +24

Multi-Modality Spatio-Temporal Forecasting via Self-Supervised Learning 3 seconds ago | arxiv.org

arxiv cs.lg forecasting self-supervised learning +4

Exploring the Frontiers of Softmax: Provable Optimization, Applications in Diffusion Model, and Beyond 3 seconds ago | arxiv.org

abstract applications architecture arxiv +23

Communication-Efficient Federated Learning with Adaptive Compression under Dynamic Bandwidth 4 seconds ago | arxiv.org

abstract arxiv bandwidth communication +12

Examining Changes in Internal Representations of Continual Learning Models Through Tensor Decomposition 5 seconds ago | arxiv.org

abstract accuracy arxiv continual +7

Deep Learning for Detecting and Early Predicting Chronic Obstructive Pulmonary Disease from Spirogram Time Series: … 6 seconds ago | arxiv.org

abstract airflow arxiv cs.ai +9

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Machine Learning Research Scientist

@ d-Matrix | San Diego, Ca

View on ai-jobs.net