SDLGASNov 15, 2022

SSM-Net: feature learning for Music Structure Analysis using a Self-Similarity-Matrix based loss

arXiv:2211.08141v12 citationsh-index: 34
Originality Incremental advance
AI Analysis

This addresses feature learning for music structure analysis, but it is incremental as it applies a new training paradigm to an existing task.

The paper tackled the problem of learning audio features for Music Structure Analysis by training a deep encoder to approximate a ground-truth Self-Similarity-Matrix, achieving an Area Under the Curve ROC (AUC) on the RWC-Pop dataset.

In this paper, we propose a new paradigm to learn audio features for Music Structure Analysis (MSA). We train a deep encoder to learn features such that the Self-Similarity-Matrix (SSM) resulting from those approximates a ground-truth SSM. This is done by minimizing a loss between both SSMs. Since this loss is differentiable w.r.t. its input features we can train the encoder in a straightforward way. We successfully demonstrate the use of this training paradigm using the Area Under the Curve ROC (AUC) on the RWC-Pop dataset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes