CoCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation Detection and Diagnosis. (arXiv:2111.08191v2 [cs.CL] UPDATED) | allainews.com

June 30, 2022, 1:12 a.m. | Nianzu Zheng, Liqun Deng, Wenyong Huang, Yu Ting Yeung, Baohua Xu, Yuanyuan Guo, Yasheng Wang, Xiao Chen, Xin Jiang, Qun Liu

cs.CL updates on arXiv.org arxiv.org

Mispronunciation detection and diagnosis (MDD) is a popular research focus in
computer-aided pronunciation training (CAPT) systems. End-to-end (e2e)
approaches are becoming dominant in MDD. However an e2e MDD model usually
requires entire speech utterances as input context, which leads to significant
time latency especially for long paragraphs. We propose a streaming e2e MDD
model called CoCA-MDD. We utilize conv-transformer structure to encode input
speech in a streaming manner. A coupled cross-attention (CoCA) mechanism is
proposed to integrate frame-level acoustic features …

arxiv attention detection diagnosis framework streaming

More from arxiv.org / cs.CL updates on arXiv.org

Drop your Decoder: Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval 9 hours ago | arxiv.org

abstract arxiv auto bag +17

Does GPT-4 pass the Turing test? 9 hours ago | arxiv.org

abstract arxiv cs.ai cs.cl +16

Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models 9 hours ago | arxiv.org

abstract arxiv challenges cs.cl +13

COPAL-ID: Indonesian Language Reasoning with Local Culture and Nuances 9 hours ago | arxiv.org

abstract arxiv causal common sense +11

Empirical study of pretrained multilingual language models for zero-shot cross-lingual knowledge transfer in generation 9 hours ago | arxiv.org

abstract arxiv cross-lingual cs.cl +17

SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation 9 hours ago | arxiv.org

abstract algorithm algorithms arxiv +19

C-Pack: Packaged Resources To Advance General Chinese Embedding 9 hours ago | arxiv.org

advance arxiv chinese cs.ai +6

$\rm SP^3$: Enhancing Structured Pruning via PCA Projection 9 hours ago | arxiv.org

abstract arxiv cs.ai cs.cl +12

Matching Patients to Clinical Trials with Large Language Models 9 hours ago | arxiv.org

abstract arxiv challenge clinical +19

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Praktikum im Bereich eMobility / Charging Solutions - Data Analysis

@ Bosch Group | Stuttgart, Germany

View on ai-jobs.net

Business Data Analyst

@ PartnerRe | Toronto, ON, Canada

View on ai-jobs.net

Machine Learning/DevOps Engineer II

@ Extend | Remote, United States

View on ai-jobs.net

Business Intelligence Developer, Marketing team (Bangkok based, relocation provided)

@ Agoda | Bangkok (Central World)

View on ai-jobs.net