May 19, 2022, 1:11 a.m. | Ye Jia, Michelle Tadmor Ramanovich, Tal Remez, Roi Pomerantz

cs.LG updates on arXiv.org arxiv.org

We present Translatotron 2, a neural direct speech-to-speech translation
model that can be trained end-to-end. Translatotron 2 consists of a speech
encoder, a linguistic decoder, an acoustic synthesizer, and a single attention
module that connects them together. Experimental results on three datasets
consistently show that Translatotron 2 outperforms the original Translatotron
by a large margin on both translation quality (up to +15.5 BLEU) and speech
generation quality, and approaches the same of cascade systems. In addition, we
propose a simple …

arxiv quality speech translation voice

Senior Data Engineer

@ Publicis Groupe | New York City, United States

Associate Principal Robotics Engineer - Research.

@ Dyson | United Kingdom - Hullavington Office

Duales Studium mit vertiefter Praxis: Bachelor of Science Künstliche Intelligenz und Data Science (m/w/d)

@ Gerresheimer | Wackersdorf, Germany

AI/ML Engineer (TS/SCI) {S}

@ ARKA Group, LP | Aurora, Colorado, United States

Data Integration Engineer

@ Find.co | Sliema

Data Engineer

@ Q2 | Bengaluru, India