Feb. 8, 2024, 5:44 a.m. | Yizhi Li Ruibin Yuan Ge Zhang Yinghao Ma Xingran Chen Hanzhi Yin Chenghao Xiao Chenghua Lin

cs.LG updates on arXiv.org arxiv.org

Self-supervised learning (SSL) has recently emerged as a promising paradigm for training generalisable models on large-scale data in the fields of vision, text, and speech. Although SSL has been proven effective in speech and audio, its application to music audio has yet to be thoroughly explored. This is partially due to the distinctive challenges associated with modelling musical knowledge, particularly tonal and pitched characteristics of music. To address this research gap, we propose an acoustic Music undERstanding model with large-scale …

application audio cs.ai cs.cl cs.lg cs.sd data eess.as fields music paradigm scale self-supervised learning speech ssl supervised learning supervised training text training understanding vision

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US