Aug. 11, 2022, 1:11 a.m. | Xiang Li, Changhe Song, Xianhao Wei, Zhiyong Wu, Jia Jia, Helen Meng

cs.CL updates on arXiv.org arxiv.org

Cross-speaker style transfer aims to extract the speech style of the given
reference speech, which can be reproduced in the timbre of arbitrary target
speakers. Existing methods on this topic have explored utilizing
utterance-level style labels to perform style transfer via either global or
local scale style representations. However, audiobook datasets are typically
characterized by both the local prosody and global genre, and are rarely
accompanied by utterance-level style labels. Thus, properly transferring the
reading style across different speakers remains …

arxiv dataset reading style transfer transfer

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Computer Vision Engineer

@ Motive | Pakistan - Remote

Data Analyst III

@ Fanatics | New York City, United States

Senior Data Scientist - Experian Health (This role is remote, from anywhere in the U.S.)

@ Experian | ., ., United States

Senior Data Engineer

@ Springer Nature Group | Pune, IN