Delta Keyword Transformer: Bringing Transformers to the Edge through Dynamically Pruned Multi-Head Self-Attention. (arXiv:2204.03479v1 [cs.CL]) | allainews.com

April 8, 2022, 1:11 a.m. | Zuzana Jelčicová, Marian Verhelst

cs.LG updates on arXiv.org arxiv.org

Multi-head self-attention forms the core of Transformer networks. However,
their quadratically growing complexity with respect to the input sequence
length impedes their deployment on resource-constrained edge devices. We
address this challenge by proposing a dynamic pruning method, which exploits
the temporal stability of data across tokens to reduce inference cost. The
threshold-based method only retains significant differences between the
subsequent tokens, effectively reducing the number of multiply-accumulates, as
well as the internal tensor data sizes. The approach is evaluated on …

arxiv attention delta edge head self-attention transformer transformers

More from arxiv.org / cs.LG updates on arXiv.org

Learning to Manipulate under Limited Information 1 day, 13 hours ago | arxiv.org

abstract arxiv become cs.ai +13

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction … 1 day, 13 hours ago | arxiv.org

abstract alignment arxiv cs.ai +17

Evolutionary Optimization of 1D-CNN for Non-contact Respiration Pattern Classification 1 day, 13 hours ago | arxiv.org

abstract arxiv classification cnn +17

Regularization by Texts for Latent Diffusion Inverse Solvers 1 day, 13 hours ago | arxiv.org

abstract arxiv challenges cs.ai +10

A Systematic Review of Aspect-based Sentiment Analysis (ABSA): Domains, Methods, and Trends 1 day, 13 hours ago | arxiv.org

abstract analysis arxiv cs.cl +13

Fossil 2.0: Formal Certificate Synthesis for the Verification and Control of Dynamical Models 1 day, 13 hours ago | arxiv.org

abstract arxiv control cs.lg +16

In-Context Learning Dynamics with Random Binary Sequences 1 day, 13 hours ago | arxiv.org

abstract art arxiv binary +24

Sharp error bounds for imbalanced classification: how many examples in the minority class? 1 day, 13 hours ago | arxiv.org

abstract arxiv class classification +15

When can transformers reason with abstract symbols? 1 day, 13 hours ago | arxiv.org

abstract arxiv capabilities cs.ai +19

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

Data Analyst, Tableau

@ NTT DATA | Bengaluru, KA, IN

View on ai-jobs.net

Junior Machine Learning Researcher

@ Weill Cornell Medicine | Doha, QA, 24144

View on ai-jobs.net

Marketing Data Analytics Intern

@ Sloan | Franklin Park, IL, US, 60131

View on ai-jobs.net

Senior Machine Learning Scientist

@ Adyen | Amsterdam

View on ai-jobs.net

Data Engineer

@ Craft.co | Warsaw, Mazowieckie

View on ai-jobs.net