Aug. 29, 2022, 1:11 a.m. | Bruno Di Giorgi, Mark Levy, Richard Sharp

cs.LG updates on arXiv.org arxiv.org

Vocoders are models capable of transforming a low-dimensional spectral
representation of an audio signal, typically the mel spectrogram, to a
waveform. Modern speech generation pipelines use a vocoder as their final
component. Recent vocoder models developed for speech achieve a high degree of
realism, such that it is natural to wonder how they would perform on music
signals. Compared to speech, the heterogeneity and structure of the musical
sound texture offers new challenges. In this work we focus on one …

arxiv spectrogram

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Lead Data Scientist, Commercial Analytics

@ Checkout.com | London, United Kingdom

Data Engineer I

@ Love's Travel Stops | Oklahoma City, OK, US, 73120