SDLGASFeb 11, 2020

Learning with Out-of-Distribution Data for Audio Classification

arXiv:2002.04683v119 citations
AI Analysis

This addresses labeling errors in audio classification, but it is incremental as it builds on existing noise-robust techniques.

The paper tackles the problem of audio classification when training data contains mislabeled out-of-distribution instances, showing that detecting and relabeling these instances improves convolutional neural network performance significantly on the FSDnoisy18k dataset.

In supervised machine learning, the assumption that training data is labelled correctly is not always satisfied. In this paper, we investigate an instance of labelling error for classification tasks in which the dataset is corrupted with out-of-distribution (OOD) instances: data that does not belong to any of the target classes, but is labelled as such. We show that detecting and relabelling certain OOD instances, rather than discarding them, can have a positive effect on learning. The proposed method uses an auxiliary classifier, trained on data that is known to be in-distribution, for detection and relabelling. The amount of data required for this is shown to be small. Experiments are carried out on the FSDnoisy18k audio dataset, where OOD instances are very prevalent. The proposed method is shown to improve the performance of convolutional neural networks by a significant margin. Comparisons with other noise-robust techniques are similarly encouraging.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes