all AI news
Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment. (arXiv:2203.15937v2 [eess.AS] UPDATED)
April 8, 2022, 1:12 a.m. | Mu Yang, Kevin Hirschi, Stephen D. Looney, Okim Kang, John H. L. Hansen
cs.LG updates on arXiv.org arxiv.org
Current leading mispronunciation detection and diagnosis (MDD) systems
achieve promising performance via end-to-end phoneme recognition. One challenge
of such end-to-end solutions is the scarcity of human-annotated phonemes on
natural L2 speech. In this work, we leverage unlabeled L2 speech via a
pseudo-labeling (PL) procedure and extend the fine-tuning approach based on
pre-trained self-supervised learning (SSL) models. Specifically, we use Wav2vec
2.0 as our SSL model, and fine-tune it using original labeled L2 speech samples
plus the created pseudo-labeled L2 speech …
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Senior Data Engineer
@ Publicis Groupe | New York City, United States
Associate Principal Robotics Engineer - Research.
@ Dyson | United Kingdom - Hullavington Office
Duales Studium mit vertiefter Praxis: Bachelor of Science Künstliche Intelligenz und Data Science (m/w/d)
@ Gerresheimer | Wackersdorf, Germany
AI/ML Engineer (TS/SCI) {S}
@ ARKA Group, LP | Aurora, Colorado, United States
Data Integration Engineer
@ Find.co | Sliema
Data Engineer
@ Q2 | Bengaluru, India