April 19, 2024, 4:43 a.m. | Yuchen Hu, Chen Chen, Qiushi Zhu, Eng Siong Chng

cs.LG updates on arXiv.org arxiv.org

arXiv:2304.04974v3 Announce Type: replace-cross
Abstract: Automatic speech recognition (ASR) has gained remarkable successes thanks to recent advances of deep learning, but it usually degrades significantly under real-world noisy conditions. Recent works introduce speech enhancement (SE) as front-end to improve speech quality, which is proved effective but may not be optimal for downstream ASR due to speech distortion problem. Based on that, latest works combine SE and currently popular self-supervised learning (SSL) to alleviate distortion and improve noise robustness. Despite the …

abstract advances arxiv asr automatic speech recognition cs.lg cs.sd deep learning eess.as front-end noise quality recognition restore robust speech speech recognition type via world

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Business Data Analyst

@ Alstom | Johannesburg, GT, ZA