MLSDMar 28, 2017

Deep scattering transform applied to note onset detection and instrument recognition

arXiv:1703.09775v11 citations
Originality Synthesis-oriented
AI Analysis

This work addresses subtasks in automatic music transcription for music information retrieval, but it is incremental as it applies an existing method to specific data.

The study tackled note onset detection and instrument recognition in automatic music transcription by applying the deep scattering transform to plucked string and piano music, showing it outperformed other sound representations on both MIDI-driven and real music datasets.

Automatic Music Transcription (AMT) is one of the oldest and most well-studied problems in the field of music information retrieval. Within this challenging research field, onset detection and instrument recognition take important places in transcription systems, as they respectively help to determine exact onset times of notes and to recognize the corresponding instrument sources. The aim of this study is to explore the usefulness of multiscale scattering operators for these two tasks on plucked string instrument and piano music. After resuming the theoretical background and illustrating the key features of this sound representation method, we evaluate its performances comparatively to other classical sound representations. Using both MIDI-driven datasets with real instrument samples and real musical pieces, scattering is proved to outperform other sound representations for these AMT subtasks, putting forward its richer sound representation and invariance properties.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes