all AI news
TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music. (arXiv:2202.00951v1 [eess.AS])
Feb. 3, 2022, 2:11 a.m. | Ke Chen, Shuai Yu, Cheng-i Wang, Wei Li, Taylor Berg-Kirkpatrick, Shlomo Dubnov
cs.LG updates on arXiv.org arxiv.org
Singing melody extraction is an important problem in the field of music
information retrieval. Existing methods typically rely on frequency-domain
representations to estimate the sung frequencies. However, this design does not
lead to human-level performance in the perception of melody information for
both tone (pitch-class) and octave. In this paper, we propose TONet, a
plug-and-play model that improves both tone and octave perceptions by
leveraging a novel input representation and a novel network architecture.
First, we present an improved input …
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Lead Data Modeler
@ Sherwin-Williams | Cleveland, OH, United States