SD ASApr 6, 2018

Does k Matter? k-NN Hubness Analysis for Kernel Additive Modelling Vocal Separation

Delia Fano Yela, Dan Stowell, Mark Sandler

arXiv:1804.02325v12.93 citations

Originality Incremental advance

AI Analysis

This work addresses a specific bottleneck in source separation for audio processing, offering an incremental improvement in parameter optimization.

The paper tackles the problem of optimizing the k parameter in Kernel Additive Modelling for vocal separation by introducing a graph theory-based method using k-NN hubness, achieving results that outperform common approaches with low computational cost.

Kernel Additive Modelling (KAM) is a framework for source separation aiming to explicitly model inherent properties of sound sources to help with their identification and separation. KAM separates a given source by applying robust statistics on the selection of time-frequency bins obtained through a source-specific kernel, typically the k-NN function. Even though the parameter k appears to be key for a successful separation, little discussion on its influence or optimisation can be found in the literature. Here we propose a novel method, based on graph theory statistics, to automatically optimise $k$ in a vocal separation task. We introduce the k-NN hubness as an indicator to find a tailored k at a low computational cost. Subsequently, we evaluate our method in comparison to the common approach to choose k. We further discuss the influence and importance of this parameter with illuminating results.

View on arXiv PDF

Similar