LG CVMar 20, 2024

Hierarchical Gaussian Mixture Normalizing Flow Modeling for Unified Anomaly Detection

Xincheng Yao, Ruoqi Li, Zefeng Qian, Lu Wang, Chongyang Zhang

arXiv:2403.13349v222.044 citationsh-index: 7Has CodeECCV

Originality Incremental advance

AI Analysis

This work addresses a challenging problem in anomaly detection for applications requiring unified models across multiple classes, representing an incremental improvement over existing methods.

The paper tackles the problem of unified anomaly detection across multiple classes by addressing the 'homogeneous mapping' issue in normalizing flow methods, where similar latent representations for normal and abnormal features lead to high anomaly missing rates. The proposed HGAD method, using hierarchical Gaussian mixture modeling and mutual information maximization, significantly improves performance over previous NF-based and state-of-the-art unified AD methods on four real-world benchmarks.

Unified anomaly detection (AD) is one of the most challenges for anomaly detection, where one unified model is trained with normal samples from multiple classes with the objective to detect anomalies in these classes. For such a challenging task, popular normalizing flow (NF) based AD methods may fall into a "homogeneous mapping" issue,where the NF-based AD models are biased to generate similar latent representations for both normal and abnormal features, and thereby lead to a high missing rate of anomalies. In this paper, we propose a novel Hierarchical Gaussian mixture normalizing flow modeling method for accomplishing unified Anomaly Detection, which we call HGAD. Our HGAD consists of two key components: inter-class Gaussian mixture modeling and intra-class mixed class centers learning. Compared to the previous NF-based AD methods, the hierarchical Gaussian mixture modeling approach can bring stronger representation capability to the latent space of normalizing flows, so that even complex multi-class distribution can be well represented and learned in the latent space. In this way, we can avoid mapping different class distributions into the same single Gaussian prior, thus effectively avoiding or mitigating the "homogeneous mapping" issue. We further indicate that the more distinguishable different class centers, the more conducive to avoiding the bias issue. Thus, we further propose a mutual information maximization loss for better structuring the latent feature space. We evaluate our method on four real-world AD benchmarks, where we can significantly improve the previous NF-based AD methods and also outperform the SOTA unified AD methods.

View on arXiv PDF Code

Similar