SDLGASMLMay 12, 2018

Extended pipeline for content-based feature engineering in music genre recognition

arXiv:1805.05324v15 citations
Originality Incremental advance
AI Analysis

This work addresses music genre identification, an incremental improvement for audio analysis applications.

The authors tackled music genre recognition by extending a traditional feature engineering pipeline with additional selection and extraction phases, resulting in a noticeable performance improvement on the GTZAN dataset.

We present a feature engineering pipeline for the construction of musical signal characteristics, to be used for the design of a supervised model for musical genre identification. The key idea is to extend the traditional two-step process of extraction and classification with additive stand-alone phases which are no longer organized in a waterfall scheme. The whole system is realized by traversing backtrack arrows and cycles between various stages. In order to give a compact and effective representation of the features, the standard early temporal integration is combined with other selection and extraction phases: on the one hand, the selection of the most meaningful characteristics based on information gain, and on the other hand, the inclusion of the nonlinear correlation between this subset of features, determined by an autoencoder. The results of the experiments conducted on GTZAN dataset reveal a noticeable contribution of this methodology towards the model's performance in classification task.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes