MLSDJul 26, 2017

Context-Independent Polyphonic Piano Onset Transcription with an Infinite Training Dataset

arXiv:1707.08438v11 citations
Originality Incremental advance
AI Analysis

This addresses the issue of limited and costly training data for piano transcription systems, enabling more robust models across varied recording conditions.

The paper tackles the problem of polyphonic piano note onset transcription by synthesizing arbitrary quantities of training data to overcome dataset limitations, achieving good performance on the MAPS dataset and excellent generalization.

Many of the recent approaches to polyphonic piano note onset transcription require training a machine learning model on a large piano database. However, such approaches are limited by dataset availability; additional training data is difficult to produce, and proposed systems often perform poorly on novel recording conditions. We propose a method to quickly synthesize arbitrary quantities of training data, avoiding the need for curating large datasets. Various aspects of piano note dynamics - including nonlinearity of note signatures with velocity, different articulations, temporal clustering of onsets, and nonlinear note partial interference - are modeled to match the characteristics of real pianos. Our method also avoids the disentanglement problem, a recently noted issue affecting machine-learning based approaches. We train a feed-forward neural network with two hidden layers on our generated training data and achieve both good transcription performance on the large MAPS piano dataset and excellent generalization qualities.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes