Feb. 3, 2022, 2:11 a.m. | Ke Chen, Shuai Yu, Cheng-i Wang, Wei Li, Taylor Berg-Kirkpatrick, Shlomo Dubnov

cs.LG updates on arXiv.org arxiv.org

Singing melody extraction is an important problem in the field of music
information retrieval. Existing methods typically rely on frequency-domain
representations to estimate the sung frequencies. However, this design does not
lead to human-level performance in the perception of melody information for
both tone (pitch-class) and octave. In this paper, we propose TONet, a
plug-and-play model that improves both tone and octave perceptions by
leveraging a novel input representation and a novel network architecture.
First, we present an improved input …

arxiv music network octave

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Lead Data Modeler

@ Sherwin-Williams | Cleveland, OH, United States