CVApr 7

SGANet: Semantic and Geometric Alignment for Multimodal Multi-view Anomaly Detection

arXiv:2604.0563240.9h-index: 4
Predicted impact top 78% in CV · last 90 daysOriginality Incremental advance
AI Analysis

This work addresses anomaly detection in industrial scenarios with multimodal multi-view data, representing an incremental improvement over existing unsupervised methods.

The paper tackles the problem of multi-view anomaly detection for surface defects on complex objects by proposing SGANet, a framework that combines semantic and geometric alignment to address feature inconsistency from viewpoint variations and modality discrepancies, achieving state-of-the-art performance on SiM3D and Eyecandies datasets.

Multi-view anomaly detection aims to identify surface defects on complex objects using observations captured from multiple viewpoints. However, existing unsupervised methods often suffer from feature inconsistency arising from viewpoint variations and modality discrepancies. To address these challenges, we propose a Semantic and Geometric Alignment Network (SGANet), a unified framework for multimodal multi-view anomaly detection that effectively combines semantic and geometric alignment to learn physically coherent feature representations across viewpoints and modalities. SGANet consists of three key components. The Selective Cross-view Feature Refinement Module (SCFRM) selectively aggregates informative patch features from adjacent views to enhance cross-view feature interaction. The Semantic-Structural Patch Alignment (SSPA) enforces semantic alignment across modalities while maintaining structural consistency under viewpoint transformations. The Multi-View Geometric Alignment (MVGA) further aligns geometrically corresponding patches across viewpoints. By jointly modeling feature interaction, semantic and structural consistency, and global geometric correspondence, SGANet effectively enhances anomaly detection performance in multimodal multi-view settings. Extensive experiments on the SiM3D and Eyecandies datasets demonstrate that SGANet achieves state-of-the-art performance in both anomaly detection and localization, validating its effectiveness in realistic industrial scenarios.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes