SDLGASJun 24, 2022

Domain Generalization with Relaxed Instance Frequency-wise Normalization for Multi-device Acoustic Scene Classification

arXiv:2206.12513v135 citationsh-index: 12
Originality Incremental advance
AI Analysis

This work addresses domain shift issues in acoustic scene classification for audio processing applications, offering a plug-and-play solution that is incremental but effective for specific tasks.

The paper tackled domain generalization in multi-device acoustic scene classification by introducing Relaxed Instance Frequency-wise Normalization (RFN), which normalizes audio features along the frequency axis to reduce domain discrepancies. The method won the DCASE2021 challenge with a clear margin, demonstrating improved robustness across multiple audio devices.

While using two-dimensional convolutional neural networks (2D-CNNs) in image processing, it is possible to manipulate domain information using channel statistics, and instance normalization has been a promising way to get domain-invariant features. However, unlike image processing, we analyze that domain-relevant information in an audio feature is dominant in frequency statistics rather than channel statistics. Motivated by our analysis, we introduce Relaxed Instance Frequency-wise Normalization (RFN): a plug-and-play, explicit normalization module along the frequency axis which can eliminate instance-specific domain discrepancy in an audio feature while relaxing undesirable loss of useful discriminative information. Empirically, simply adding RFN to networks shows clear margins compared to previous domain generalization approaches on acoustic scene classification and yields improved robustness for multiple audio devices. Especially, the proposed RFN won the DCASE2021 challenge TASK1A, low-complexity acoustic scene classification with multiple devices, with a clear margin, and RFN is an extended work of our technical report.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes