July 21, 2022, 1:11 a.m. | Nathaniel Robinson, Perez Ogayo, Swetha Gangu, David R. Mortensen, Shinji Watanabe

cs.CL updates on arXiv.org arxiv.org

Developing Automatic Speech Recognition (ASR) for low-resource languages is a
challenge due to the small amount of transcribed audio data. For many such
languages, audio and text are available separately, but not audio with
transcriptions. Using text, speech can be synthetically produced via
text-to-speech (TTS) systems. However, many low-resource languages do not have
quality TTS systems either. We propose an alternative: produce synthetic audio
by running text from the target language through a trained TTS system for a
higher-resource pivot …

arxiv augmentation language pivot tts

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Computer Vision Engineer

@ Motive | Pakistan - Remote

Data Analyst III

@ Fanatics | New York City, United States

Senior Data Scientist - Experian Health (This role is remote, from anywhere in the U.S.)

@ Experian | ., ., United States

Senior Data Engineer

@ Springer Nature Group | Pune, IN