GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech. (arXiv:2205.07211v2 [eess.AS] UPDATED) | allainews.com

Oct. 13, 2022, 1:18 a.m. | Rongjie Huang, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao

cs.CL updates on arXiv.org arxiv.org

Style transfer for out-of-domain (OOD) speech synthesis aims to generate
speech samples with unseen style (e.g., speaker identity, emotion, and prosody)
derived from an acoustic reference, while facing the following challenges: 1)
The highly dynamic style features in expressive voice are difficult to model
and transfer; and 2) the TTS models should be robust enough to handle diverse
OOD conditions that differ from the source data. This paper proposes
GenerSpeech, a text-to-speech model towards high-fidelity zero-shot style
transfer of OOD …

arxiv speech style transfer text text-to-speech transfer

More from arxiv.org / cs.CL updates on arXiv.org

Designing LLM Chains by Adapting Techniques from Crowdsourcing Workflows 14 hours ago | arxiv.org

abstract arxiv crowdsourcing cs.ai +13

GraphGPT: Graph Instruction Tuning for Large Language Models 14 hours ago | arxiv.org

arxiv cs.ai cs.cl graph +6

How Fragile is Relation Extraction under Entity Replacements? 14 hours ago | arxiv.org

arxiv cs.ai cs.cl extraction +1

Granite Code Models: A Family of Open Foundation Models for Code Intelligence 14 hours ago | arxiv.org

abstract agents arxiv code +25

Enriched BERT Embeddings for Scholarly Publication Classification 14 hours ago | arxiv.org

abstract academic articles arxiv +16

Sketch Then Generate: Providing Incremental User Feedback and Guiding LLM Code Generation through Language-Oriented Code … 14 hours ago | arxiv.org

abstract arxiv code code generation +20

HAFFormer: A Hierarchical Attention-Free Framework for Alzheimer's Disease Detection From Spontaneous Speech 14 hours ago | arxiv.org

abstract alzheimer's architectures arxiv +22

CleanGraph: Human-in-the-loop Knowledge Graph Refinement and Completion 14 hours ago | arxiv.org

arxiv cs.ai cs.cl graph +5

Conformity, Confabulation, and Impersonation: Persona Inconstancy in Multi-Agent LLM Collaboration 14 hours ago | arxiv.org

abstract agent agents analyze +19

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net