Uniformer: Unified Transformer for Efficient Spatiotemporal Representation Learning. (arXiv:2201.04676v1 [cs.CV]) | allainews.com

Jan. 14, 2022, 2:10 a.m. | Kunchang Li, Yali Wang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao

cs.CV updates on arXiv.org arxiv.org

It is a challenging task to learn rich and multi-scale spatiotemporal
semantics from high-dimensional videos, due to large local redundancy and
complex global dependency between video frames. The recent advances in this
research have been mainly driven by 3D convolutional neural networks and vision
transformers. Although 3D convolution can efficiently aggregate local context
to suppress local redundancy from a small 3D neighborhood, it lacks the
capability to capture global dependency because of the limited receptive field.
Alternatively, vision transformers can …

arxiv cv learning transformer

More from arxiv.org / cs.CV updates on arXiv.org

Attention-Map Augmentation for Hypercomplex Breast Cancer Classification 1 day, 6 hours ago | arxiv.org

arxiv attention augmentation cancer +5

Hidden Flaws Behind Expert-Level Accuracy of GPT-4 Vision in Medicine 1 day, 6 hours ago | arxiv.org

abstract accuracy analysis arxiv +26

A Survey on Autonomous Driving Datasets: Statistics, Annotation Quality, and a Future Outlook 1 day, 6 hours ago | arxiv.org

abstract advances algorithms annotation +20

Towards Effective Multi-Moving-Camera Tracking: A New Dataset and Lightweight Link Model 1 day, 6 hours ago | arxiv.org

arxiv cs.cv dataset moving +2

Holodeck: Language Guided Generation of 3D Embodied AI Environments 1 day, 6 hours ago | arxiv.org

abstract arxiv cs.ai cs.cl +12

Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance 1 day, 6 hours ago | arxiv.org

3d object 3d object detection arxiv cs.cv +6

Fine-tuning vision foundation model for crack segmentation in civil infrastructures 1 day, 6 hours ago | arxiv.org

abstract adapter ai models arxiv +15

FG-MDM: Towards Zero-Shot Human Motion Generation via Fine-Grained Descriptions 1 day, 6 hours ago | arxiv.org

abstract arxiv beyond cs.cv +16

X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model 1 day, 6 hours ago | arxiv.org

abstract adapter arxiv control +20

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Alternant Data Engineering

@ Aspire Software | Angers, FR

View on ai-jobs.net

Senior Software Engineer, Generative AI

@ Google | Dublin, Ireland

View on ai-jobs.net