SDLGASMLSep 2, 2018

Multitask Learning for Fundamental Frequency Estimation in Music

arXiv:1809.00381v120 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of separate estimation tasks in music analysis, offering a unified approach that could benefit audio processing applications, though it appears incremental as it builds on existing learning-based methods.

The paper tackles the problem of fundamental frequency estimation in polyphonic music by proposing a multitask deep learning architecture that jointly estimates multiple-f0, melody, vocal, and bass line outputs, and shows it outperforms single-task models and is competitive with strong baselines.

Fundamental frequency (f0) estimation from polyphonic music includes the tasks of multiple-f0, melody, vocal, and bass line estimation. Historically these problems have been approached separately, and only recently, using learning-based approaches. We present a multitask deep learning architecture that jointly estimates outputs for various tasks including multiple-f0, melody, vocal and bass line estimation, and is trained using a large, semi-automatically annotated dataset. We show that the multitask model outperforms its single-task counterparts, and explore the effect of various design decisions in our approach, and show that it performs better or at least competitively when compared against strong baseline methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes