all AI news
Cross-modal Contrastive Learning for Speech Translation. (arXiv:2205.02444v1 [cs.CL])
May 6, 2022, 1:11 a.m. | Rong Ye, Mingxuan Wang, Lei Li
cs.CL updates on arXiv.org arxiv.org
How can we learn unified representations for spoken utterances and their
written text? Learning similar representations for semantically similar speech
and text is important for speech translation. To this end, we propose ConST, a
cross-modal contrastive learning method for end-to-end speech-to-text
translation. We evaluate ConST and a variety of previous baselines on a popular
benchmark MuST-C. Experiments show that the proposed ConST consistently
outperforms the previous methods on, and achieves an average BLEU of 29.4. The
analysis further verifies that …
More from arxiv.org / cs.CL updates on arXiv.org
Jobs in AI, ML, Big Data
Senior ML Researcher - 3D Geometry Processing | 3D Shape Generation | 3D Mesh Data
@ Promaton | Europe
Data Scientist, Senior
@ Pacific Gas and Electric Company | Oakland, CA, US, 94612
AML Reporting Data Specialist
@ Wise | Tallinn, Estonia
Bachelorarbeit im Bereich IT - "Einsatz von Generative AI im Konzernumfeld" (WiSe 24/25)
@ AGCO | Marktoberdorf, DE
Big Data Engineer
@ ACL Technology | Argentina
REF25217Q-Deputy Manager - MIS (Power BI, Dashboard, Excel) - GGN
@ WNS Global Services | Gurgaon, India