all AI news
Mel Spectrogram Inversion with Stable Pitch. (arXiv:2208.12782v1 [cs.SD])
Aug. 29, 2022, 1:11 a.m. | Bruno Di Giorgi, Mark Levy, Richard Sharp
cs.LG updates on arXiv.org arxiv.org
Vocoders are models capable of transforming a low-dimensional spectral
representation of an audio signal, typically the mel spectrogram, to a
waveform. Modern speech generation pipelines use a vocoder as their final
component. Recent vocoder models developed for speech achieve a high degree of
realism, such that it is natural to wonder how they would perform on music
signals. Compared to speech, the heterogeneity and structure of the musical
sound texture offers new challenges. In this work we focus on one …
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Lead Data Scientist, Commercial Analytics
@ Checkout.com | London, United Kingdom
Data Engineer I
@ Love's Travel Stops | Oklahoma City, OK, US, 73120