SD LG AS MLSep 2, 2018

Multitask Learning for Fundamental Frequency Estimation in Music

Rachel M. Bittner, Brian McFee, Juan P. Bello

arXiv:1809.00381v112.620 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the challenge of separate estimation tasks in music analysis, offering a unified approach that could benefit audio processing applications, though it appears incremental as it builds on existing learning-based methods.

The paper tackles the problem of fundamental frequency estimation in polyphonic music by proposing a multitask deep learning architecture that jointly estimates multiple-f0, melody, vocal, and bass line outputs, and shows it outperforms single-task models and is competitive with strong baselines.

Fundamental frequency (f0) estimation from polyphonic music includes the tasks of multiple-f0, melody, vocal, and bass line estimation. Historically these problems have been approached separately, and only recently, using learning-based approaches. We present a multitask deep learning architecture that jointly estimates outputs for various tasks including multiple-f0, melody, vocal and bass line estimation, and is trained using a large, semi-automatically annotated dataset. We show that the multitask model outperforms its single-task counterparts, and explore the effect of various design decisions in our approach, and show that it performs better or at least competitively when compared against strong baseline methods.

View on arXiv PDF Code

Similar