April 19, 2024, 4:43 a.m. | Yuchen Hu, Chen Chen, Qiushi Zhu, Eng Siong Chng

cs.LG updates on arXiv.org arxiv.org

arXiv:2304.04974v3 Announce Type: replace-cross
Abstract: Automatic speech recognition (ASR) has gained remarkable successes thanks to recent advances of deep learning, but it usually degrades significantly under real-world noisy conditions. Recent works introduce speech enhancement (SE) as front-end to improve speech quality, which is proved effective but may not be optimal for downstream ASR due to speech distortion problem. Based on that, latest works combine SE and currently popular self-supervised learning (SSL) to alleviate distortion and improve noise robustness. Despite the …

abstract advances arxiv asr automatic speech recognition cs.lg cs.sd deep learning eess.as front-end noise quality recognition restore robust speech speech recognition type via world

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York