all AI news
Non-Intrusive Speech Intelligibility Prediction for Hearing Aids using Whisper and Metadata
June 14, 2024, 4:47 a.m. | Ryandhimas E. Zezario, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao
cs.LG updates on arXiv.org arxiv.org
Abstract: Automated speech intelligibility assessment is pivotal for hearing aid (HA) development. In this paper, we present three novel methods to improve intelligibility prediction accuracy and introduce MBI-Net+, an enhanced version of MBI-Net, the top-performing system in the 1st Clarity Prediction Challenge. MBI-Net+ leverages Whisper's embeddings to create cross-domain acoustic features and includes metadata from speech signals by using a classifier that distinguishes different enhancement methods. Furthermore, MBI-Net+ integrates the hearing-aid speech perception index (HASPI) as …
abstract accuracy arxiv assessment automated challenge cs.lg cs.sd development eess.as hearing metadata novel paper pivotal prediction replace speech type whisper
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Senior Data Engineer
@ Displate | Warsaw
Senior Algorithms Engineer (Image Processing)
@ KLA | USA-MI-Ann Arbor-KLA
Principal Software Development Engineer
@ Yahoo | US - United States of America
Data Domain Architect, Vice President
@ JPMorgan Chase & Co. | Columbus, OH, United States
Senior, Data Scientist, Sam's Personalization
@ Cox Enterprises | (USA) TX MCKINNEY 04906 SAM'S CLUB
Software Engineering Specialist
@ GE HealthCare | Bengaluru HEALTHCARE (JFWTC) IN