Jan. 31, 2024, 4:46 p.m. | Arhan Jain, Alec Bunn, TJ Tsai

cs.LG updates on arXiv.org arxiv.org

This article motivates, describes, and presents the PBSCSR dataset for
studying composer style recognition of piano sheet music. Our overarching goal
was to create a dataset for studying composer style recognition that is "as
accessible as MNIST and as challenging as ImageNet." To achieve this goal, we
sample fixed-length bootleg score fragments from piano sheet music images on
IMSLP. The dataset itself contains 40,000 62x64 bootleg score images for a
9-way classification task, 100,000 62x64 bootleg score images for a …

article arxiv cs.sd dataset imagenet mnist music recognition sample studying style

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead Data Engineer

@ WorkMoney | New York City, United States - Remote