DENOISER: Rethinking the Robustness for Open-Vocabulary Action Recognition | allainews.com

April 24, 2024, 4:45 a.m. | Haozhe Cheng, Cheng Ju, Haicheng Wang, Jinxiang Liu, Mengting Chen, Qiang Hu, Xiaoyun Zhang, Yanfeng Wang

cs.CV updates on arXiv.org arxiv.org

arXiv:2404.14890v1 Announce Type: new
Abstract: As one of the fundamental video tasks in computer vision, Open-Vocabulary Action Recognition (OVAR) recently gains increasing attention, with the development of vision-language pre-trainings. To enable generalization of arbitrary classes, existing methods treat class labels as text descriptions, then formulate OVAR as evaluating embedding similarity between visual samples and textual classes. However, one crucial issue is completely ignored: the class descriptions given by users may be noisy, e.g., misspellings and typos, limiting the real-world practicality …

abstract action recognition arxiv attention class computer computer vision cs.cv development embedding fundamental labels language recognition robustness tasks text type video vision vision-language visual

More from arxiv.org / cs.CV updates on arXiv.org

One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts 1 day, 2 hours ago | arxiv.org

abstract arxiv building construction +18

Uncertainty Quantification with Deep Ensembles for 6D Object Pose Estimation 1 day, 2 hours ago | arxiv.org

abstract applications arxiv automation +15

Morphing Tokens Draw Strong Masked Image Models 1 day, 2 hours ago | arxiv.org

arxiv cs.cv image tokens +1

Compact 3D Scene Representation via Self-Organizing Gaussian Grids 1 day, 2 hours ago | arxiv.org

arxiv compact cs.cv representation +2

Fingerprint Matching with Localized Deep Representation 1 day, 2 hours ago | arxiv.org

abstract accuracy acquisition arxiv +8

A Survey on Transferability of Adversarial Examples across Deep Neural Networks 1 day, 2 hours ago | arxiv.org

abstract adversarial adversarial examples arxiv +27

Content Bias in Deep Learning Image Age Approximation: A new Approach Towards better Explainability 1 day, 2 hours ago | arxiv.org

abstract age approximation arxiv +15

Continual Action Assessment via Task-Consistent Score-Discriminative Feature Distribution Modeling 1 day, 2 hours ago | arxiv.org

arxiv assessment consistent continual +6

DA-RAW: Domain Adaptive Object Detection for Real-World Adverse Weather Conditions 1 day, 2 hours ago | arxiv.org

abstract arxiv cs.cv cs.ro +17

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net