all AI news
Textless Speech-to-Speech Translation on Real Data. (arXiv:2112.08352v2 [cs.CL] UPDATED)
May 6, 2022, 1:12 a.m. | Ann Lee, Hongyu Gong, Paul-Ambroise Duquenne, Holger Schwenk, Peng-Jen Chen, Changhan Wang, Sravya Popuri, Yossi Adi, Juan Pino, Jiatao Gu, Wei-Ning H
cs.LG updates on arXiv.org arxiv.org
We present a textless speech-to-speech translation (S2ST) system that can
translate speech from one language into another language and can be built
without the need of any text data. Different from existing work in the
literature, we tackle the challenge in modeling multi-speaker target speech and
train the systems with real-world S2ST data. The key to our approach is a
self-supervised unit-based speech normalization technique, which finetunes a
pre-trained speech encoder with paired audios from multiple speakers and a
single …
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Senior ML Researcher - 3D Geometry Processing | 3D Shape Generation | 3D Mesh Data
@ Promaton | Europe
Director, Global Procurement Data Analytics
@ Alcon | Fort Worth - Main
Backend Software Engineer, Airbnb for Real Estate
@ Airbnb | United States
Data Scientist
@ Exoticca | Barcelona, Catalonia, Spain - Remote
ESG Data Analytics Summer Associate (Intern)
@ Apex Clean Energy | Charlottesville, VA, United States
Team Lead, Machine Learning
@ Prenuvo | Vancouver, British Columbia, Canada