May 17, 2022, 12:05 p.m. | /u/ShujiMikami

Machine Learning www.reddit.com

Hello,

I've recently made a [post](https://www.reddit.com/r/MachineLearning/comments/ul8igh/d_is_wgangp_gradient_penalty_applicable_to_the/) on this subreddit asking for clarifications on a certain paper describing an implementation of WGAN-GP for speech synthesis from silent videos. Those answers were all really helpful in better understanding the learning process, however more have cropped up as I began digging deeper.

I'm currently attempting training a hybrid model between the architectures described in [these](https://arxiv.org/pdf/1906.06301.pdf) [two](https://arxiv.org/pdf/2104.13332.pdf) papers, with a generator and objective function from the former and the critic and PASE blocks from …

audio function machinelearning sampling speech understanding wgan

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Principal Machine Learning Engineer (AI, NLP, LLM, Generative AI)

@ Palo Alto Networks | Santa Clara, CA, United States

Consultant Senior Data Engineer F/H

@ Devoteam | Nantes, France