all AI news
Source-Filter HiFi-GAN: Fast and Pitch Controllable High-Fidelity Neural Vocoder. (arXiv:2210.15533v1 [cs.SD])
Oct. 28, 2022, 1:11 a.m. | Reo Yoneyama, Yi-Chiao Wu, Tomoki Toda
cs.LG updates on arXiv.org arxiv.org
Our previous work, the unified source-filter GAN (uSFGAN) vocoder, introduced
a novel architecture based on the source-filter theory into the parallel
waveform generative adversarial network to achieve high voice quality and pitch
controllability. However, the high temporal resolution inputs result in high
computation costs. Although the HiFi-GAN vocoder achieves fast high-fidelity
voice generation thanks to the efficient upsampling-based generator
architecture, the pitch controllability is severely limited. To realize a fast
and pitch-controllable high-fidelity neural vocoder, we introduce the
source-filter theory …
More from arxiv.org / cs.LG updates on arXiv.org
The Perception-Robustness Tradeoff in Deterministic Image Restoration
2 days, 2 hours ago |
arxiv.org
Jobs in AI, ML, Big Data
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne