ASLGSDSep 11, 2019

Guided Learning Convolution System for DCASE 2019 Task 4

arXiv:1909.06178v159 citations
Originality Synthesis-oriented
AI Analysis

This work addresses sound event detection for domestic environments, representing an incremental improvement in a specific domain challenge.

The paper tackles sound event detection in domestic environments using a convolutional neural network with attention pooling and semi-supervised learning, achieving an event-based F-measure of 45.43% on validation and 42.7% on test sets, which was the best performance in the DCASE2019 challenge.

In this paper, we describe in detail the system we submitted to DCASE2019 task 4: sound event detection (SED) in domestic environments. We employ a convolutional neural network (CNN) with an embedding-level attention pooling module to solve it. By considering the interference caused by the co-occurrence of multiple events in the unbalanced dataset, we utilize the disentangled feature to raise the performance of the model. To take advantage of the unlabeled data, we adopt Guided Learning for semi-supervised learning. A group of median filters with adaptive window sizes is utilized in the post-processing of output probabilities of the model. We also analyze the effect of the synthetic data on the performance of the model and finally achieve an event-based F-measure of 45.43% on the validation set and an event-based F-measure of 42.7% on the test set. The system we submitted to the challenge achieves the best performance compared to those of other participates.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes