ASAILGSDJul 23, 2022

Low-complexity CNNs for Acoustic Scene Classification

arXiv:2207.11529v14 citationsh-index: 66
Originality Incremental advance
AI Analysis

This work addresses deployment challenges for acoustic scene classification on resource-constrained devices, representing an incremental improvement in efficiency.

The paper tackled the problem of high computational complexity and memory requirements in CNNs for acoustic scene classification by proposing a low-complexity framework, resulting in an ensemble with about 60K parameters, 19M operations, and a 2-4 percentage point performance improvement over the baseline.

This paper presents a low-complexity framework for acoustic scene classification (ASC). Most of the frameworks designed for ASC use convolutional neural networks (CNNs) due to their learning ability and improved performance compared to hand-engineered features. However, CNNs are resource hungry due to their large size and high computational complexity. Therefore, CNNs are difficult to deploy on resource constrained devices. This paper addresses the problem of reducing the computational complexity and memory requirement in CNNs. We propose a low-complexity CNN architecture, and apply pruning and quantization to further reduce the parameters and memory. We then propose an ensemble framework that combines various low-complexity CNNs to improve the overall performance. An experimental evaluation of the proposed framework is performed on the publicly available DCASE 2022 Task 1 that focuses on ASC. The proposed ensemble framework has approximately 60K parameters, requires 19M multiply-accumulate operations and improves the performance by approximately 2-4 percentage points compared to the DCASE 2022 Task 1 baseline network.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes