CVApr 9

GroundingAnomaly: Spatially-Grounded Diffusion for Few-Shot Anomaly Synthesis

arXiv:2604.0830174.2
AI Analysis

This addresses the scarcity of anomalous data for visual inspection in industrial settings, offering a novel method for precise anomaly synthesis.

The paper tackled the problem of limited real anomalous samples in industrial quality control by proposing GroundingAnomaly, a few-shot anomaly image generation framework that uses spatial conditioning and gated self-attention to synthesize high-quality anomalies, achieving state-of-the-art performance on MVTec AD and VisA datasets.

The performance of visual anomaly inspection in industrial quality control is often constrained by the scarcity of real anomalous samples. Consequently, anomaly synthesis techniques have been developed to enlarge training sets and enhance downstream inspection. However, existing methods either suffer from poor integration caused by inpainting or fail to provide accurate masks. To address these limitations, we propose GroundingAnomaly, a novel few-shot anomaly image generation framework. Our framework introduces a Spatial Conditioning Module that leverages per-pixel semantic maps to enable precise spatial control over the synthesized anomalies. Furthermore, a Gated Self-Attention Module is designed to inject conditioning tokens into a frozen U-Net via gated attention layers. This carefully preserves pretrained priors while ensuring stable few-shot adaptation. Extensive evaluations on the MVTec AD and VisA datasets demonstrate that GroundingAnomaly generates high-quality anomalies and achieves state-of-the-art performance across multiple downstream tasks, including anomaly detection, segmentation, and instance-level detection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes