all AI news
Unsupervised ASR via Cross-Lingual Pseudo-Labeling
Feb. 19, 2024, 5:43 a.m. | Tatiana Likhomanenko, Loren Lugosch, Ronan Collobert
cs.LG updates on arXiv.org arxiv.org
Abstract: Recent work has shown that it is possible to train an $\textit{unsupervised}$ automatic speech recognition (ASR) system using only unpaired audio and text. Existing unsupervised ASR methods assume that no labeled data can be used for training. We argue that even if one does not have any labeled audio for a given language, there is $\textit{always}$ labeled data available for other languages. We show that it is possible to use character-level acoustic models (AMs) from …
abstract arxiv asr audio automatic speech recognition cross-lingual cs.cl cs.lg cs.sd data eess.as labeling recognition speech speech recognition text train training type unsupervised via work
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Senior Data Scientist
@ ITE Management | New York City, United States