July 14, 2022, 1:12 a.m. | Yookyung Shin, Younggun Lee, Suhee Jo, Yeongtae Hwang, Taesu Kim

cs.CL updates on arXiv.org arxiv.org

Expressive text-to-speech has shown improved performance in recent years.
However, the style control of synthetic speech is often restricted to discrete
emotion categories and requires training data recorded by the target speaker in
the target style. In many practical situations, users may not have reference
speech recorded in target emotion but still be interested in controlling speech
style just by typing text description of desired emotional style. In this
paper, we propose a text-based interface for emotional style control and …

arxiv style transfer text transfer tts

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Machine Learning Engineer (m/f/d)

@ StepStone Group | Düsseldorf, Germany

2024 GDIA AI/ML Scientist - Supplemental

@ Ford Motor Company | United States