LGMLJun 6, 2019

Guided learning for weakly-labeled semi-supervised sound event detection

arXiv:1906.02517v56 citations
Originality Incremental advance
AI Analysis

This addresses sound event detection for audio analysis applications, but it is incremental as it builds on existing weakly-labeled methods.

The paper tackles weakly-labeled semi-supervised sound event detection by proposing Guided Learning, which uses a teacher model for audio tagging to guide a student model for boundary detection, achieving competitive performance on the DCASE2018 Task4 dataset.

We propose a simple but efficient method termed Guided Learning for weakly-labeled semi-supervised sound event detection (SED). There are two sub-targets implied in weakly-labeled SED: audio tagging and boundary detection. Instead of designing a single model by considering a trade-off between the two sub-targets, we design a teacher model aiming at audio tagging to guide a student model aiming at boundary detection to learn using the unlabeled data. The guidance is guaranteed by the audio tagging performance gap of the two models. In the meantime, the student model liberated from the trade-off is able to provide more excellent boundary detection results. We propose a principle to design such two models based on the relation between the temporal compression scale and the two sub-targets. We also propose an end-to-end semi-supervised learning process for these two models to enable their abilities to rise alternately. Experiments on the DCASE2018 Task4 dataset show that our approach achieves competitive performance.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes