Feb. 19, 2024, 5:43 a.m. | Tatiana Likhomanenko, Loren Lugosch, Ronan Collobert

cs.LG updates on arXiv.org arxiv.org

arXiv:2305.13330v3 Announce Type: replace-cross
Abstract: Recent work has shown that it is possible to train an $\textit{unsupervised}$ automatic speech recognition (ASR) system using only unpaired audio and text. Existing unsupervised ASR methods assume that no labeled data can be used for training. We argue that even if one does not have any labeled audio for a given language, there is $\textit{always}$ labeled data available for other languages. We show that it is possible to use character-level acoustic models (AMs) from …

abstract arxiv asr audio automatic speech recognition cross-lingual cs.cl cs.lg cs.sd data eess.as labeling recognition speech speech recognition text train training type unsupervised via work

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Data Scientist

@ ITE Management | New York City, United States