Dissecting Query-Key Interaction in Vision Transformers | allainews.com

May 27, 2024, 4:46 a.m. | Xu Pan, Aaron Philip, Ziqian Xie, Odelia Schwartz

cs.CV updates on arXiv.org arxiv.org

arXiv:2405.14880v1 Announce Type: new
Abstract: Self-attention in vision transformers has been thought to perform perceptual grouping where tokens attend to other tokens with similar embeddings, which could correspond to semantically similar features in an image. However, contextualization is also an important and necessary computation for processing signals. Contextualization potentially requires tokens to attend to dissimilar tokens such as those corresponding to backgrounds or different objects, but this effect has not been reported in previous studies. In this study, we investigate …

abstract arxiv attention computation contextualization cs.ai cs.cv embeddings features grouping however image key processing query self-attention thought tokens transformers type vision vision transformers

More from arxiv.org / cs.CV updates on arXiv.org

Super-resolution multi-contrast unbiased eye atlases with deep probabilistic refinement 14 hours ago | arxiv.org

abstract arxiv contrast cs.cv +14

GenH2R: Learning Generalizable Human-to-Robot Handover via Scalable Simulation, Demonstration, and Imitation 14 hours ago | arxiv.org

arxiv cs.cv cs.ro human +6

A kinetic approach to consensus-based segmentation of biomedical images 14 hours ago | arxiv.org

abstract apply arxiv biomedical +15

Retraining-free Model Quantization via One-Shot Weight-Coupling Learning 14 hours ago | arxiv.org

arxiv cs.cv free quantization +4

Long-Tailed 3D Detection via 2D Late Fusion 14 hours ago | arxiv.org

3d object 3d object detection abstract arxiv +16

Wired Perspectives: Multi-View Wire Art Embraces Generative AI 14 hours ago | arxiv.org

abstract ai system art artists +18

Multimodal Transformer Using Cross-Channel attention for Object Detection in Remote Sensing Images 14 hours ago | arxiv.org

abstract annotated data applications arxiv +17

IBoxCLA: Towards Robust Box-supervised Segmentation of Polyp via Improved Box-dice and Contrastive Latent-anchors 14 hours ago | arxiv.org

abstract anchors arxiv attention +17

Bayesian uncertainty-weighted loss for improved generalisability on polyp segmentation task 14 hours ago | arxiv.org

abstract acquisition arxiv bayesian +14

Senior Data Engineer

@ Displate | Warsaw

View on ai-jobs.net

Lead Python Developer - Generative AI

@ S&P Global | US - TX - VIRTUAL

View on ai-jobs.net

Analytics Engineer - Design Experience

@ Canva | Sydney, Australia

View on ai-jobs.net

Data Architect

@ Unisys | Bengaluru - RGA Tech Park

View on ai-jobs.net

Data Architect

@ HP | PSR01 - Bengaluru, Pritech Park- SEZ (PSR01)

View on ai-jobs.net

Streetlight Analyst

@ DTE Energy | Belleville, MI, US

View on ai-jobs.net