Web: http://arxiv.org/abs/2201.12039

Jan. 31, 2022, 2:11 a.m. | Kishan Gupta, Srikanth Korse, Bernd Edler, Guillaume Fuchs

cs.LG updates on arXiv.org arxiv.org

Frequency domain processing, and in particular the use of Modified Discrete
Cosine Transform (MDCT), is the most widespread approach to audio coding.
However, at low bitrates, audio quality, especially for speech, degrades
drastically due to the lack of available bits to directly code the transform
coefficients. Traditionally, post-filtering has been used to mitigate artefacts
in the coded speech by exploiting a-priori information of the source and extra
transmitted parameters. Recently, data-driven post-filters have shown better
results, but at the cost …

arxiv speech

More from arxiv.org / cs.LG updates on arXiv.org

Data Engineer, Buy with Prime

@ Amazon.com | Santa Monica, California, USA

Data Architect – Public Sector Health Data Architect, WWPS

@ Amazon.com | US, VA, Virtual Location - Virginia

[Job 8224] Data Engineer - Developer Senior

@ CI&T | Brazil

Software Engineer, Machine Learning, Planner/Behavior Prediction

@ Nuro, Inc. | Mountain View, California (HQ)

Lead Data Scientist

@ Inspectorio | Ho Chi Minh City, Ho Chi Minh City, Vietnam - Remote

Data Engineer

@ Craftable | Portugal - Remote