all AI news
SpeechCLIP+: Self-supervised multi-task representation learning for speech via CLIP and speech-image data
Feb. 13, 2024, 5:48 a.m. | Hsuan-Fu Wang Yi-Jen Shih Heng-Jui Chang Layne Berry Puyuan Peng Hung-yi Lee Hsin-Min Wang Dav
cs.CL updates on arXiv.org arxiv.org
apply clip continuous cs.cl cs.sd data eess.as extensions fire framework image image data images paper representation representation learning speech text text transcription through transcription via
More from arxiv.org / cs.CL updates on arXiv.org
Jobs in AI, ML, Big Data
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne