Nov. 11, 2023, 5:58 a.m. | Essam Wisam

Towards Data Science - Medium towardsdatascience.com

Learn about text-to-speech and how it’s realized by transformers

In 2019, FastSpeech has pushed the frontier of neural text-to-speech by offering significant improvement in inference speed while maintaining robustness to prevent word repetition or omission. It also allowed for controllability of the output speech in terms of speech and prosody.

In this story, we aim to familiarize you with how transformers are employed for text-to-speech, provide you with a concise overview of the FastSpeech paper and point you to how …

aim deep learning implementation improvement inference learn nlp overview paper robustness speech speed story terms text text-to-speech thoughts-and-theory transformers word

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

Principal Data Architect - Azure & Big Data

@ MGM Resorts International | Home Office - US, NV

GN SONG MT Market Research Data Analyst 11

@ Accenture | Bengaluru, BDC7A