SD LG ASDec 14, 2024

Audio-based Anomaly Detection in Industrial Machines Using Deep One-Class Support Vector Data Description

Sertac Kilickaya, Mete Ahishali, Cansu Celebioglu, Fahad Sohrab, Levent Eren, Turker Ince, Murat Askar, Moncef Gabbouj

arXiv:2412.10792v16.76 citationsh-index: 332025 IEEE Symposium on Computational Intelligence on Engineering/Cyber Physical Systems Companion (CIES Companion)

Originality Synthesis-oriented

AI Analysis

This work addresses the problem of cost-effective condition monitoring for industrial equipment operators, but it is incremental as it applies an existing method to a specific domain with improvements in performance and efficiency.

The study tackled anomaly detection in industrial machines using audio data, finding that a deep one-class support vector data description method with a subspace dimension of 2 outperformed a baseline autoencoder, achieving average AUC scores of 0.84, 0.80, and 0.69 across different noise levels and using 7.4 times fewer parameters.

The frequent breakdowns and malfunctions of industrial equipment have driven increasing interest in utilizing cost-effective and easy-to-deploy sensors, such as microphones, for effective condition monitoring of machinery. Microphones offer a low-cost alternative to widely used condition monitoring sensors with their high bandwidth and capability to detect subtle anomalies that other sensors might have less sensitivity. In this study, we investigate malfunctioning industrial machines to evaluate and compare anomaly detection performance across different machine types and fault conditions. Log-Mel spectrograms of machinery sound are used as input, and the performance is evaluated using the area under the curve (AUC) score for two different methods: baseline dense autoencoder (AE) and one-class deep Support Vector Data Description (deep SVDD) with different subspace dimensions. Our results over the MIMII sound dataset demonstrate that the deep SVDD method with a subspace dimension of 2 provides superior anomaly detection performance, achieving average AUC scores of 0.84, 0.80, and 0.69 for 6 dB, 0 dB, and -6 dB signal-to-noise ratios (SNRs), respectively, compared to 0.82, 0.72, and 0.64 for the baseline model. Moreover, deep SVDD requires 7.4 times fewer trainable parameters than the baseline dense AE, emphasizing its advantage in both effectiveness and computational efficiency.

View on arXiv PDF

Similar