ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech. (arXiv:2202.07816v1 [eess.AS]) | allainews.com

Feb. 17, 2022, 8:10 a.m. | Yi Ren, Ming Lei, Zhiying Huang, Shiliang Zhang, Qian Chen, Zhijie Yan, Zhou Zhao

cs.CL updates on arXiv.org arxiv.org

Expressive text-to-speech (TTS) has become a hot research topic recently,
mainly focusing on modeling prosody in speech. Prosody modeling has several
challenges: 1) the extracted pitch used in previous prosody modeling works have
inevitable errors, which hurts the prosody modeling; 2) different attributes of
prosody (e.g., pitch, duration and energy) are dependent on each other and
produce the natural prosody together; and 3) due to high variability of prosody
and the limited amount of high-quality data for TTS training, the …

arxiv speech text text-to-speech training

More from arxiv.org / cs.CL updates on arXiv.org

A Text Classification Framework for Simple and Effective Early Depression Detection Over Social Media Streams 22 hours ago | arxiv.org

abstract arxiv build classification +22

A Survey on Prompting Techniques in LLMs 22 hours ago | arxiv.org

abstract arxiv autoregressive cs.ai +24

Enabling On-Device Large Language Model Personalization with Self-Supervised Data Selection and Synthesis 22 hours ago | arxiv.org

abstract arxiv conversation cs.cl +21

ML-Bench: Evaluating Large Language Models for Code Generation in Repository-Level Machine Learning Tasks 22 hours ago | arxiv.org

arxiv code code generation cs.ai +9

Strings from the Library of Babel: Random Sampling as a Strong Baseline for Prompt Optimisation 22 hours ago | arxiv.org

abstract arxiv cs.ai cs.cl +16

Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study 22 hours ago | arxiv.org

abstract arxiv case case study +19

Formal Aspects of Language Modeling 22 hours ago | arxiv.org

abstract artificial artificial intelligence arxiv +24

Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model 22 hours ago | arxiv.org

abstract advanced arxiv challenges +24

Predicting Emergent Abilities with Infinite Resolution Evaluation 22 hours ago | arxiv.org

abstract arxiv cs.cl evaluation +19

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

Enterprise Data Architect

@ Pathward | Remote

View on ai-jobs.net

Diagnostic Imaging Information Systems (DIIS) Technologist

@ Nova Scotia Health Authority | Halifax, NS, CA, B3K 6R8

View on ai-jobs.net

Intern Data Scientist - Residual Value Risk Management (f/m/d)

@ BMW Group | Munich, DE

View on ai-jobs.net

Analytics Engineering Manager

@ PlayStation Global | United Kingdom, London

View on ai-jobs.net

Junior Insight Analyst (PR&Comms)

@ Signal AI | Lisbon, Lisbon, Portugal

View on ai-jobs.net