June 14, 2024, 4:47 a.m. | Ryandhimas E. Zezario, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao

cs.LG updates on arXiv.org arxiv.org

arXiv:2309.09548v2 Announce Type: replace-cross
Abstract: Automated speech intelligibility assessment is pivotal for hearing aid (HA) development. In this paper, we present three novel methods to improve intelligibility prediction accuracy and introduce MBI-Net+, an enhanced version of MBI-Net, the top-performing system in the 1st Clarity Prediction Challenge. MBI-Net+ leverages Whisper's embeddings to create cross-domain acoustic features and includes metadata from speech signals by using a classifier that distinguishes different enhancement methods. Furthermore, MBI-Net+ integrates the hearing-aid speech perception index (HASPI) as …

abstract accuracy arxiv assessment automated challenge cs.lg cs.sd development eess.as hearing metadata novel paper pivotal prediction replace speech type whisper

Senior Data Engineer

@ Displate | Warsaw

Senior Algorithms Engineer (Image Processing)

@ KLA | USA-MI-Ann Arbor-KLA

Principal Software Development Engineer

@ Yahoo | US - United States of America

Data Domain Architect, Vice President

@ JPMorgan Chase & Co. | Columbus, OH, United States

Senior, Data Scientist, Sam's Personalization

@ Cox Enterprises | (USA) TX MCKINNEY 04906 SAM'S CLUB

Software Engineering Specialist

@ GE HealthCare | Bengaluru HEALTHCARE (JFWTC) IN