LG MLJun 6, 2019

Guided learning for weakly-labeled semi-supervised sound event detection

Liwei Lin, Xiangdong Wang, Hong Liu, Yueliang Qian

arXiv:1906.02517v55.46 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses sound event detection for audio analysis applications, but it is incremental as it builds on existing weakly-labeled methods.

The paper tackles weakly-labeled semi-supervised sound event detection by proposing Guided Learning, which uses a teacher model for audio tagging to guide a student model for boundary detection, achieving competitive performance on the DCASE2018 Task4 dataset.

We propose a simple but efficient method termed Guided Learning for weakly-labeled semi-supervised sound event detection (SED). There are two sub-targets implied in weakly-labeled SED: audio tagging and boundary detection. Instead of designing a single model by considering a trade-off between the two sub-targets, we design a teacher model aiming at audio tagging to guide a student model aiming at boundary detection to learn using the unlabeled data. The guidance is guaranteed by the audio tagging performance gap of the two models. In the meantime, the student model liberated from the trade-off is able to provide more excellent boundary detection results. We propose a principle to design such two models based on the relation between the temporal compression scale and the two sub-targets. We also propose an end-to-end semi-supervised learning process for these two models to enable their abilities to rise alternately. Experiments on the DCASE2018 Task4 dataset show that our approach achieves competitive performance.

View on arXiv PDF Code

Similar