Cross Pseudo-Labeling for Semi-Supervised Audio-Visual Source Localization | allainews.com

March 6, 2024, 5:45 a.m. | Yuxin Guo, Shijie Ma, Yuhao Zhao, Hu Su, Wei Zou

cs.CV updates on arXiv.org arxiv.org

arXiv:2403.03095v1 Announce Type: new
Abstract: Audio-Visual Source Localization (AVSL) is the task of identifying specific sounding objects in the scene given audio cues. In our work, we focus on semi-supervised AVSL with pseudo-labeling. To address the issues with vanilla hard pseudo-labels including bias accumulation, noise sensitivity, and instability, we propose a novel method named Cross Pseudo-Labeling (XPL), wherein two models learn from each other with the cross-refine mechanism to avoid bias accumulation. We equip XPL with two effective components. Firstly, …

abstract arxiv audio bias cs.cv cs.mm cs.sd eess.as focus labeling labels localization noise novel objects semi-supervised sensitivity type visual work

More from arxiv.org / cs.CV updates on arXiv.org

Having Second Thoughts? Let's hear it 2 hours ago | arxiv.org

abstract arxiv brain cognitive +20

Towards Imbalanced Motion: Part-Decoupling Network for Video Portrait Segmentation 2 hours ago | arxiv.org

abstract arxiv attention cs.cv +15

Decoupling Dynamic Monocular Videos for Dynamic View Synthesis 2 hours ago | arxiv.org

abstract arxiv challenge cs.cv +13

From CNNs to Shift-Invariant Twin Models Based on Complex Wavelets 2 hours ago | arxiv.org

abstract accuracy arxiv cnns +20

Behind Every Domain There is a Shift: Adapting Distortion-aware Vision Transformers for Panoramic Semantic Segmentation 2 hours ago | arxiv.org

arxiv cs.cv cs.ro domain +10

Self-supervised Feature-Gate Coupling for Dynamic Network Pruning 2 hours ago | arxiv.org

abstract arxiv computational cost +16

An Organic Weed Control Prototype using Directed Energy and Deep Learning 2 hours ago | arxiv.org

abstract array arxiv control +15

You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet 2 hours ago | arxiv.org

abstract arxiv attention attention mechanisms +20

Generative Adversarial Networks in Ultrasound Imaging: Extending Field of View Beyond Conventional Limits 2 hours ago | arxiv.org

abstract adversarial arxiv beyond +18

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

View on ai-jobs.net

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

View on ai-jobs.net

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

View on ai-jobs.net

Senior Applied Data Scientist

@ dunnhumby | London

View on ai-jobs.net

Principal Data Architect - Azure & Big Data

@ MGM Resorts International | Home Office - US, NV

View on ai-jobs.net