Deploying self-supervised learning in the wild for hybrid automatic speech recognition. (arXiv:2205.08598v1 [cs.SD]) | allainews.com

May 19, 2022, 1:10 a.m. | Mostafa Karimi, Changliang Liu, Kenichi Kumatani, Yao Qian, Tianyu Wu, Jian Wu

cs.CL updates on arXiv.org arxiv.org

Self-supervised learning (SSL) methods have proven to be very successful in
automatic speech recognition (ASR). These great improvements have been reported
mostly based on highly curated datasets such as LibriSpeech for non-streaming
End-to-End ASR models. However, the pivotal characteristics of SSL is to be
utilized for any untranscribed audio data. In this paper, we provide a full
exploration on how to utilize uncurated audio data in SSL from data
pre-processing to deploying an streaming hybrid ASR model. More specifically,
we …

arxiv automatic speech recognition hybrid learning self-supervised learning speech speech recognition supervised learning

More from arxiv.org / cs.CL updates on arXiv.org

Drop your Decoder: Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval 4 hours ago | arxiv.org

abstract arxiv auto bag +17

Does GPT-4 pass the Turing test? 4 hours ago | arxiv.org

abstract arxiv cs.ai cs.cl +16

Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models 4 hours ago | arxiv.org

abstract arxiv challenges cs.cl +13

COPAL-ID: Indonesian Language Reasoning with Local Culture and Nuances 4 hours ago | arxiv.org

abstract arxiv causal common sense +11

Empirical study of pretrained multilingual language models for zero-shot cross-lingual knowledge transfer in generation 4 hours ago | arxiv.org

abstract arxiv cross-lingual cs.cl +17

SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation 4 hours ago | arxiv.org

abstract algorithm algorithms arxiv +19

C-Pack: Packaged Resources To Advance General Chinese Embedding 4 hours ago | arxiv.org

advance arxiv chinese cs.ai +6

$\rm SP^3$: Enhancing Structured Pruning via PCA Projection 4 hours ago | arxiv.org

abstract arxiv cs.ai cs.cl +12

Matching Patients to Clinical Trials with Large Language Models 4 hours ago | arxiv.org

abstract arxiv challenge clinical +19

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Principal Engineer, Deep Learning

@ Outrider | Remote

View on ai-jobs.net

Data Analyst (Bangkok based, relocation provided)

@ Agoda | Bangkok (Central World Office)

View on ai-jobs.net

Data Scientist II

@ MoEngage | Bengaluru

View on ai-jobs.net

Machine Learning Engineer

@ Sika AG | Welwyn Garden City, United Kingdom

View on ai-jobs.net