CVAug 5, 2017

Detecting Noteheads in Handwritten Scores with ConvNets and Bounding Box Regression

arXiv:1708.01806v123 citations
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in music notation processing for researchers and practitioners, offering a machine learning-based solution to replace heuristic methods.

The paper tackled the problem of detecting noteheads in handwritten music scores, which is essential for Optical Music Recognition, by developing a convolutional neural network with bounding box regression that achieved a detection f-score of 0.97 on the MUSCIMA++ dataset without requiring staff removal.

Noteheads are the interface between the written score and music. Each notehead on the page signifies one note to be played, and detecting noteheads is thus an unavoidable step for Optical Music Recognition. Noteheads are clearly distinct objects, however, the variety of music notation handwriting makes noteheads harder to identify, and while handwritten music notation symbol {\em classification} is a well-studied task, symbol {\em detection} has usually been limited to heuristics and rule-based systems instead of machine learning methods better suited to deal with the uncertainties in handwriting. We present ongoing work on a simple notehead detector using convolutional neural networks for pixel classification and bounding box regression that achieves a detection f-score of 0.97 on binary score images in the MUSCIMA++ dataset, does not require staff removal, and is applicable to a variety of handwriting styles and levels of musical complexity.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes