Nov. 7, 2022, 4:59 a.m. | /u/aerialbits

Machine Learning www.reddit.com

Most of the ones that I've seen are only text to speech. Would be great if there were one that took into account the source voice's inflections, pacing, etc.

So far I've only been able to find [StarGANv2](https://github.com/yl4579/StarGANv2-VC). Which [one redditor](https://www.reddit.com/r/MachineLearning/comments/xn046x/comment/iqp1iea/?utm_source=share&utm_medium=web2x&context=3) used to create [this](https://www.youtube.com/watch?v=cEaG2LYFAFA). Is this the best there is or are there better alternatives?

Thanks!

EDIT:

Digging a bit deeper, I found this project called [IMS-Toucan](https://github.com/DigitalPhonetics/IMS-Toucan) which has several very impressive demos on huggingface.

deep fake fake machinelearning project speech voice

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York