SDASMar 23

Structural and Statistical Audio Texture Knowledge Distillation for Acoustic Classification

arXiv:2501.0192118.1h-index: 3
Predicted impact top 83% in SD · last 90 daysOriginality Incremental advance
AI Analysis

This work addresses the need for better acoustic classification in complex environments, though it is incremental as it builds on existing knowledge distillation methods.

The paper tackled the problem of environmental sound classification by proposing the SSATKD framework to incorporate low-level audio texture features, achieving consistent accuracy improvements across four diverse datasets including passive sonar and general environmental sounds.

While knowledge distillation has shown success in various audio tasks, its application to environmental sound classification often overlooks essential low-level audio texture features needed to capture local patterns in complex acoustic environments. To address this gap, the Structural and Statistical Audio Texture Knowledge Distillation (SSATKD) framework is proposed, which combines high-level contextual information with low-level structural and statistical audio textures extracted from intermediate layers. To evaluate its generalizability across diverse acoustic domains, SSATKD is tested on four datasets within the environmental sound classification domain, including two passive sonar datasets (DeepShip and Vessel Type Underwater Acoustic Data (VTUAD)) and two general environmental sound datasets (Environmental Sound Classification 50 (ESC-50) and Tampere University of Technology (TUT) Acoustic Scenes). Two teacher adaptation strategies are explored: classifier-head-only adaptation and full fine-tuning. The framework is further evaluated using various convolutional and transformer-based teacher models. Experimental results demonstrate consistent accuracy improvements across all datasets and settings, confirming the effectiveness and robustness of SSATKD in real-world sound classification tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes