SD LG ASNov 15, 2022

SSM-Net: feature learning for Music Structure Analysis using a Self-Similarity-Matrix based loss

arXiv:2211.08141v15.72 citationsh-index: 34

Originality Incremental advance

AI Analysis

This addresses feature learning for music structure analysis, but it is incremental as it applies a new training paradigm to an existing task.

The paper tackled the problem of learning audio features for Music Structure Analysis by training a deep encoder to approximate a ground-truth Self-Similarity-Matrix, achieving an Area Under the Curve ROC (AUC) on the RWC-Pop dataset.

In this paper, we propose a new paradigm to learn audio features for Music Structure Analysis (MSA). We train a deep encoder to learn features such that the Self-Similarity-Matrix (SSM) resulting from those approximates a ground-truth SSM. This is done by minimizing a loss between both SSMs. Since this loss is differentiable w.r.t. its input features we can train the encoder in a straightforward way. We successfully demonstrate the use of this training paradigm using the Area Under the Curve ROC (AUC) on the RWC-Pop dataset.

View on arXiv PDF

Similar