ASLGSDJul 27, 2020

Receptive-Field Regularized CNNs for Music Classification and Tagging

arXiv:2007.13503v14 citations
Originality Incremental advance
AI Analysis

This work addresses a domain-specific bottleneck in MIR by making deep architectures competitive, offering a new baseline for music classification and tagging.

The paper tackled the problem of deep CNNs like ResNet underperforming in Music Information Retrieval (MIR) tasks due to lack of generalization, and showed that applying Receptive-Field Regularization and Shake-Shake techniques significantly improves their performance, enabling them to outperform current complex models on two MIR tasks and datasets.

Convolutional Neural Networks (CNNs) have been successfully used in various Music Information Retrieval (MIR) tasks, both as end-to-end models and as feature extractors for more complex systems. However, the MIR field is still dominated by the classical VGG-based CNN architecture variants, often in combination with more complex modules such as attention, and/or techniques such as pre-training on large datasets. Deeper models such as ResNet -- which surpassed VGG by a large margin in other domains -- are rarely used in MIR. One of the main reasons for this, as we will show, is the lack of generalization of deeper CNNs in the music domain. In this paper, we present a principled way to make deep architectures like ResNet competitive for music-related tasks, based on well-designed regularization strategies. In particular, we analyze the recently introduced Receptive-Field Regularization and Shake-Shake, and show that they significantly improve the generalization of deep CNNs on music-related tasks, and that the resulting deep CNNs can outperform current more complex models such as CNNs augmented with pre-training and attention. We demonstrate this on two different MIR tasks and two corresponding datasets, thus offering our deep regularized CNNs as a new baseline for these datasets, which can also be used as a feature-extracting module in future, more complex approaches.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes