all AI news
Audio Deepfake Detection Based on a Combination of F0 Information and Real Plus Imaginary Spectrogram Features. (arXiv:2208.01214v1 [cs.SD])
cs.LG updates on arXiv.org arxiv.org
Recently, pioneer research works have proposed a large number of acoustic
features (log power spectrogram, linear frequency cepstral coefficients,
constant Q cepstral coefficients, etc.) for audio deepfake detection, obtaining
good performance, and showing that different subbands have different
contributions to audio deepfake detection. However, this lacks an explanation
of the specific information in the subband, and these features also lose
information such as phase. Inspired by the mechanism of synthetic speech, the
fundamental frequency (F0) information is used to improve …
arxiv audio combination deepfake detection features information spectrogram