Web: http://arxiv.org/abs/2201.12039

Jan. 31, 2022, 2:11 a.m. | Kishan Gupta, Srikanth Korse, Bernd Edler, Guillaume Fuchs

cs.LG updates on arXiv.org arxiv.org

Frequency domain processing, and in particular the use of Modified Discrete
Cosine Transform (MDCT), is the most widespread approach to audio coding.
However, at low bitrates, audio quality, especially for speech, degrades
drastically due to the lack of available bits to directly code the transform
coefficients. Traditionally, post-filtering has been used to mitigate artefacts
in the coded speech by exploiting a-priori information of the source and extra
transmitted parameters. Recently, data-driven post-filters have shown better
results, but at the cost …

arxiv speech

