March 1, 2024, 5:49 a.m. | Jeehyun Lee, Yerin Choi, Tae-Jin Song, Myoung-Wan Koo

cs.CL updates on arXiv.org arxiv.org

arXiv:2402.18923v1 Announce Type: new
Abstract: Dysarthria, a common issue among stroke patients, severely impacts speech intelligibility. Inappropriate pauses are crucial indicators in severity assessment and speech-language therapy. We propose to extend a large-scale speech recognition model for inappropriate pause detection in dysarthric speech. To this end, we propose task design, labeling strategy, and a speech recognition model with an inappropriate pause prediction layer. First, we treat pause detection as speech recognition, using an automatic speech recognition (ASR) model to convert …

abstract arxiv assessment cs.cl cs.sd design detection eess.as impacts inappropriate issue language patients recognition scale speech speech recognition stroke therapy type

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Lead Data Scientist, Commercial Analytics

@ Checkout.com | London, United Kingdom

Data Engineer I

@ Love's Travel Stops | Oklahoma City, OK, US, 73120