all AI news
Deploying self-supervised learning in the wild for hybrid automatic speech recognition. (arXiv:2205.08598v1 [cs.SD])
cs.CL updates on arXiv.org arxiv.org
Self-supervised learning (SSL) methods have proven to be very successful in
automatic speech recognition (ASR). These great improvements have been reported
mostly based on highly curated datasets such as LibriSpeech for non-streaming
End-to-End ASR models. However, the pivotal characteristics of SSL is to be
utilized for any untranscribed audio data. In this paper, we provide a full
exploration on how to utilize uncurated audio data in SSL from data
pre-processing to deploying an streaming hybrid ASR model. More specifically,
we …
arxiv automatic speech recognition hybrid learning self-supervised learning speech speech recognition supervised learning