all AI news
[D] Understanding audio sampling function for a speech synthesis WGAN
May 17, 2022, 12:05 p.m. | /u/ShujiMikami
Machine Learning www.reddit.com
I've recently made a [post](https://www.reddit.com/r/MachineLearning/comments/ul8igh/d_is_wgangp_gradient_penalty_applicable_to_the/) on this subreddit asking for clarifications on a certain paper describing an implementation of WGAN-GP for speech synthesis from silent videos. Those answers were all really helpful in better understanding the learning process, however more have cropped up as I began digging deeper.
I'm currently attempting training a hybrid model between the architectures described in [these](https://arxiv.org/pdf/1906.06301.pdf) [two](https://arxiv.org/pdf/2104.13332.pdf) papers, with a generator and objective function from the former and the critic and PASE blocks from …
audio function machinelearning sampling speech understanding wgan
More from www.reddit.com / Machine Learning
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Principal Machine Learning Engineer (AI, NLP, LLM, Generative AI)
@ Palo Alto Networks | Santa Clara, CA, United States
Consultant Senior Data Engineer F/H
@ Devoteam | Nantes, France