Unraveling Attention via Convex Duality: Analysis and Interpretations of Vision Transformers. (arXiv:2205.08078v2 [cs.LG] UPDATED) | allainews.com

May 23, 2022, 1:12 a.m. | Arda Sahiner, Tolga Ergen, Batu Ozturkler, John Pauly, Morteza Mardani, Mert Pilanci

cs.CV updates on arXiv.org arxiv.org

Vision transformers using self-attention or its proposed alternatives have
demonstrated promising results in many image related tasks. However, the
underpinning inductive bias of attention is not well understood. To address
this issue, this paper analyzes attention through the lens of convex duality.
For the non-linear dot-product self-attention, and alternative mechanisms such
as MLP-mixer and Fourier Neural Operator (FNO), we derive equivalent
finite-dimensional convex problems that are interpretable and solvable to
global optimality. The convex programs lead to {\it block nuclear-norm …

analysis arxiv attention transformers vision

More from arxiv.org / cs.CV updates on arXiv.org

Attention-Map Augmentation for Hypercomplex Breast Cancer Classification 1 day, 5 hours ago | arxiv.org

arxiv attention augmentation cancer +5

Hidden Flaws Behind Expert-Level Accuracy of GPT-4 Vision in Medicine 1 day, 5 hours ago | arxiv.org

abstract accuracy analysis arxiv +26

A Survey on Autonomous Driving Datasets: Statistics, Annotation Quality, and a Future Outlook 1 day, 5 hours ago | arxiv.org

abstract advances algorithms annotation +20

Towards Effective Multi-Moving-Camera Tracking: A New Dataset and Lightweight Link Model 1 day, 5 hours ago | arxiv.org

arxiv cs.cv dataset moving +2

Holodeck: Language Guided Generation of 3D Embodied AI Environments 1 day, 5 hours ago | arxiv.org

abstract arxiv cs.ai cs.cl +12

Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance 1 day, 5 hours ago | arxiv.org

3d object 3d object detection arxiv cs.cv +6

Fine-tuning vision foundation model for crack segmentation in civil infrastructures 1 day, 5 hours ago | arxiv.org

abstract adapter ai models arxiv +15

FG-MDM: Towards Zero-Shot Human Motion Generation via Fine-Grained Descriptions 1 day, 5 hours ago | arxiv.org

abstract arxiv beyond cs.cv +16

X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model 1 day, 5 hours ago | arxiv.org

abstract adapter arxiv control +20

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Alternant Data Engineering

@ Aspire Software | Angers, FR

View on ai-jobs.net

Senior Software Engineer, Generative AI

@ Google | Dublin, Ireland

View on ai-jobs.net