all AI news
Duplex Conversation: Towards Human-like Interaction in Spoken Dialogue Systems. (arXiv:2205.15060v4 [cs.CL] UPDATED)
June 15, 2022, 1:12 a.m. | Ting-En Lin, Yuchuan Wu, Fei Huang, Luo Si, Jian Sun, Yongbin Li
cs.CL updates on arXiv.org arxiv.org
In this paper, we present Duplex Conversation, a multi-turn, multimodal
spoken dialogue system that enables telephone-based agents to interact with
customers like a human. We use the concept of full-duplex in telecommunication
to demonstrate what a human-like interactive experience should be and how to
achieve smooth turn-taking through three subtasks: user state detection,
backchannel selection, and barge-in detection. Besides, we propose
semi-supervised learning with multimodal data augmentation to leverage
unlabeled data to increase model generalization. Experimental results on three
sub-tasks …
More from arxiv.org / cs.CL updates on arXiv.org
Jobs in AI, ML, Big Data
Senior ML Researcher - 3D Geometry Processing | 3D Shape Generation | 3D Mesh Data
@ Promaton | Europe
Director, Global Procurement Data Analytics
@ Alcon | Fort Worth - Main
Backend Software Engineer, Airbnb for Real Estate
@ Airbnb | United States
Data Scientist
@ Exoticca | Barcelona, Catalonia, Spain - Remote
ESG Data Analytics Summer Associate (Intern)
@ Apex Clean Energy | Charlottesville, VA, United States
Team Lead, Machine Learning
@ Prenuvo | Vancouver, British Columbia, Canada