Learning Multi-view Anomaly Detection with Efficient Adaptive Selection
This addresses the problem of blind spots in single-view anomaly detection for applications like surveillance or medical imaging, though it is incremental as it builds on existing multi-view methods.
The paper tackles multi-view anomaly detection by introducing a Multi-View Adaptive Selection algorithm that integrates features across views, achieving state-of-the-art performance with an average improvement of +2.5 across 10 metrics on the Real-IAD dataset.
This study explores the recently proposed and challenging multi-view Anomaly Detection (AD) task. Single-view tasks will encounter blind spots from other perspectives, resulting in inaccuracies in sample-level prediction. Therefore, we introduce the Multi-View Anomaly Detection (MVAD) approach, which learns and integrates features from multi-views. Specifically, we propose a Multi-View Adaptive Selection (MVAS) algorithm for feature learning and fusion across multiple views. The feature maps are divided into neighbourhood attention windows to calculate a semantic correlation matrix between single-view windows and all other views, which is an attention mechanism conducted for each single-view window and the top-k most correlated multi-view windows. Adjusting the window sizes and top-k can minimise the complexity to O((hw)^4/3). Extensive experiments on the Real-IAD dataset under the multi-class setting validate the effectiveness of our approach, achieving state-of-the-art performance with an average improvement of +2.5 across 10 metrics at the sample/image/pixel levels, using only 18M parameters and requiring fewer FLOPs and training time. The codes are available at https://github.com/lewandofskee/MVAD.