Feb. 5, 2024, 6:43 a.m. | Drew Edwards Simon Dixon Emmanouil Benetos Akira Maezawa Yuta Kusaka

cs.LG updates on arXiv.org arxiv.org

Algorithms for automatic piano transcription have improved dramatically in recent years due to new datasets and modeling techniques. Recent developments have focused primarily on adapting new neural network architectures, such as the Transformer and Perceiver, in order to yield more accurate systems. In this work, we study transcription systems from the perspective of their training data. By measuring their performance on out-of-distribution annotated piano data, we show how these models can severely overfit to acoustic properties of the training data. …

algorithms analysis architectures cs.lg cs.sd data data-driven datasets eess.as modeling network neural network perceiver perspective robust study systems transcription transformer work

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne