Aug. 26, 2022, 1:10 a.m. | Ilaria Manco, Emmanouil Benetos, Elio Quinton, György Fazekas

cs.LG updates on arXiv.org arxiv.org

As one of the most intuitive interfaces known to humans, natural language has
the potential to mediate many tasks that involve human-computer interaction,
especially in application-focused fields like Music Information Retrieval. In
this work, we explore cross-modal learning in an attempt to bridge audio and
language in the music domain. To this end, we propose MusCALL, a framework for
Music Contrastive Audio-Language Learning. Our approach consists of a
dual-encoder architecture that learns the alignment between pairs of music
audio and …

arxiv audio language learning music

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Analytics & Insight Specialist, Customer Success

@ Fortinet | Ottawa, ON, Canada

Account Director, ChatGPT Enterprise - Majors

@ OpenAI | Remote - Paris