May 8, 2023, 12:47 a.m. | Zhixi Cai, Shreya Ghosh, Abhinav Dhall, Tom Gedeon, Kalin Stefanov, Munawar Hayat

cs.CV updates on arXiv.org arxiv.org

Most deepfake detection methods focus on detecting spatial and/or
spatio-temporal changes in facial attributes. This is because available
benchmark datasets contain mostly visual-only modifications. However, a
sophisticated deepfake may include small segments of audio or audio-visual
manipulations that can completely change the meaning of the content. To
addresses this gap, we propose and benchmark a new dataset, Localized Audio
Visual DeepFake (LAV-DF), consisting of strategic content-driven audio, visual
and audio-visual manipulations. The proposed baseline method, Boundary Aware
Temporal Forgery Detection …

arxiv audio benchmark change datasets deepfake detection focus glitch localization matrix scale small temporal the matrix

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne