StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech Synthesis. (arXiv:2205.15439v1 [eess.AS]) | allainews.com

June 1, 2022, 1:12 a.m. | Yinghao Aaron Li, Cong Han, Nima Mesgarani

cs.CL updates on arXiv.org arxiv.org

Text-to-Speech (TTS) has recently seen great progress in synthesizing
high-quality speech owing to the rapid development of parallel TTS systems, but
producing speech with naturalistic prosodic variations, speaking styles and
emotional tones remains challenging. Moreover, since duration and speech are
generated separately, parallel TTS models still have problems finding the best
monotonic alignments that are crucial for naturalistic speech synthesis. Here,
we propose StyleTTS, a style-based generative model for parallel TTS that can
synthesize diverse speech with natural prosody from …

arxiv natural speech text text-to-speech

More from arxiv.org / cs.CL updates on arXiv.org

Drop your Decoder: Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval an hour ago | arxiv.org

abstract arxiv auto bag +17

Does GPT-4 pass the Turing test? an hour ago | arxiv.org

abstract arxiv cs.ai cs.cl +16

Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models an hour ago | arxiv.org

abstract arxiv challenges cs.cl +13

COPAL-ID: Indonesian Language Reasoning with Local Culture and Nuances an hour ago | arxiv.org

abstract arxiv causal common sense +11

Empirical study of pretrained multilingual language models for zero-shot cross-lingual knowledge transfer in generation an hour ago | arxiv.org

abstract arxiv cross-lingual cs.cl +17

SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation an hour ago | arxiv.org

abstract algorithm algorithms arxiv +19

C-Pack: Packaged Resources To Advance General Chinese Embedding an hour ago | arxiv.org

advance arxiv chinese cs.ai +6

$\rm SP^3$: Enhancing Structured Pruning via PCA Projection an hour ago | arxiv.org

abstract arxiv cs.ai cs.cl +12

Matching Patients to Clinical Trials with Large Language Models an hour ago | arxiv.org

abstract arxiv challenge clinical +19

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Manager, Data Management & Insights Asia

@ Swiss Re | Bengaluru, KA, IN

View on ai-jobs.net

Data Science Co-op

@ Authenticate | United States - Remote

View on ai-jobs.net

Intern 2024 - Data Engineer, Smart MFG & AI

@ Micron Technology | Taoyuan - Fab 11, Taiwan

View on ai-jobs.net

Data Engineer

@ Nine | Sydney, Australia

View on ai-jobs.net