ASIRLGSDSep 12, 2019

Sams-Net: A Sliced Attention-based Neural Network for Music Source Separation

arXiv:1909.05746v48 citations
Originality Highly original
AI Analysis

This work addresses music source separation for audio processing applications, representing an incremental improvement with a novel method for a known bottleneck.

The paper tackled music source separation by proposing Sams-Net, a sliced attention-based neural network in the spectrogram domain, which outperformed most state-of-the-art DNN-based methods on the MUSDB18 dataset with fewer parameters.

Convolutional Neural Network (CNN) or Long short-term memory (LSTM) based models with the input of spectrogram or waveforms are commonly used for deep learning based audio source separation. In this paper, we propose a Sliced Attention-based neural network (Sams-Net) in the spectrogram domain for the music source separation task. It enables spectral feature interactions with multi-head attention mechanism, achieves easier parallel computing and has a larger receptive field compared with LSTMs and CNNs respectively. Experimental results on the MUSDB18 dataset show that the proposed method, with fewer parameters, outperforms most of the state-of-the-art DNN-based methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes