CVDLIRJun 12, 2025

Sheet Music Benchmark: Standardized Optical Music Recognition Evaluation

arXiv:2506.10488v25 citationsh-index: 4ISMIR
Originality Synthesis-oriented
AI Analysis

This work addresses a long-standing gap in OMR evaluation for researchers and end-users, though it is incremental as it builds on existing metrics like Symbol Error Rate.

The authors tackled the lack of standardized evaluation in Optical Music Recognition by introducing the Sheet Music Benchmark dataset and the OMR Normalized Edit Distance metric, resulting in a new benchmark with 685 pages and a fine-grained error analysis tool.

In this work, we introduce the Sheet Music Benchmark (SMB), a dataset of six hundred and eighty-five pages specifically designed to benchmark Optical Music Recognition (OMR) research. SMB encompasses a diverse array of musical textures, including monophony, pianoform, quartet, and others, all encoded in Common Western Modern Notation using the Humdrum **kern format. Alongside SMB, we introduce the OMR Normalized Edit Distance (OMR-NED), a new metric tailored explicitly for evaluating OMR performance. OMR-NED builds upon the widely-used Symbol Error Rate (SER), offering a fine-grained and detailed error analysis that covers individual musical elements such as note heads, beams, pitches, accidentals, and other critical notation features. The resulting numeric score provided by OMR-NED facilitates clear comparisons, enabling researchers and end-users alike to identify optimal OMR approaches. Our work thus addresses a long-standing gap in OMR evaluation, and we support our contributions with baseline experiments using standardized SMB dataset splits for training and assessing state-of-the-art methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes