ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation. (arXiv:2211.13189v1 [cs.SD]) | allainews.com

Nov. 24, 2022, 7:17 a.m. | Sara Atito, Muhammad Awais, Wenwu Wang, Mark D Plumbley, Josef Kittler

cs.CV updates on arXiv.org arxiv.org

Vision transformers, which were originally developed for natural language
processing, have recently generated significant interest in the computer vision
and audio communities due to their flexibility in learning long-range
relationships. Constrained by data hungry nature of transformers and limited
labelled data most transformer-based models for audio tasks are finetuned from
ImageNet pretrained models, despite the huge gap between the natural images
domain and audio domain. This has motivated the research in self-supervised
pretraining of audio transformers, which reduces the dependency …

arxiv audio general representation spectrogram transformer vision

More from arxiv.org / cs.CV updates on arXiv.org

One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts 13 hours ago | arxiv.org

abstract arxiv building construction +18

Uncertainty Quantification with Deep Ensembles for 6D Object Pose Estimation 13 hours ago | arxiv.org

abstract applications arxiv automation +15

Morphing Tokens Draw Strong Masked Image Models 13 hours ago | arxiv.org

arxiv cs.cv image tokens +1

Compact 3D Scene Representation via Self-Organizing Gaussian Grids 13 hours ago | arxiv.org

arxiv compact cs.cv representation +2

Fingerprint Matching with Localized Deep Representation 13 hours ago | arxiv.org

abstract accuracy acquisition arxiv +8

A Survey on Transferability of Adversarial Examples across Deep Neural Networks 13 hours ago | arxiv.org

abstract adversarial adversarial examples arxiv +27

Content Bias in Deep Learning Image Age Approximation: A new Approach Towards better Explainability 13 hours ago | arxiv.org

abstract age approximation arxiv +15

Continual Action Assessment via Task-Consistent Score-Discriminative Feature Distribution Modeling 13 hours ago | arxiv.org

arxiv assessment consistent continual +6

DA-RAW: Domain Adaptive Object Detection for Real-World Adverse Weather Conditions 13 hours ago | arxiv.org

abstract arxiv cs.cv cs.ro +17

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Data Engineer (m/f/d)

@ Project A Ventures | Berlin, Germany

View on ai-jobs.net

Principle Research Scientist

@ Analog Devices | US, MA, Boston

View on ai-jobs.net