all AI news
A neural prosody encoder for end-ro-end dialogue act classification. (arXiv:2205.05590v1 [cs.CL])
Web: http://arxiv.org/abs/2205.05590
May 12, 2022, 1:11 a.m. | Kai Wei, Dillon Knox, Martin Radfar, Thanh Tran, Markus Muller, Grant P. Strimel, Nathan Susanj, Athanasios Mouchtaris, Maurizio Omologo
cs.CL updates on arXiv.org arxiv.org
Dialogue act classification (DAC) is a critical task for spoken language
understanding in dialogue systems. Prosodic features such as energy and pitch
have been shown to be useful for DAC. Despite their importance, little research
has explored neural approaches to integrate prosodic features into end-to-end
(E2E) DAC models which infer dialogue acts directly from audio signals. In this
work, we propose an E2E neural architecture that takes into account the need
for characterizing prosodic phenomena co-occurring at different levels inside …
More from arxiv.org / cs.CL updates on arXiv.org
Latest AI/ML/Big Data Jobs
Director, Applied Mathematics & Computational Research Division
@ Lawrence Berkeley National Lab | Berkeley, Ca
Business Data Analyst
@ MainStreet Family Care | Birmingham, AL
Assistant/Associate Professor of the Practice in Business Analytics
@ Georgetown University McDonough School of Business | Washington DC
Senior Data Science Writer
@ NannyML | Remote
Director of AI/ML Engineering
@ Armis Industries | Remote (US only), St. Louis, California
Digital Analytics Manager
@ Patagonia | Ventura, California