CVMar 14

VID-AD: A Dataset for Image-Level Logical Anomaly Detection under Vision-Induced Distraction

arXiv:2603.1396449.5h-index: 8Has Code
Predicted impact top 70% in CV · last 90 daysOriginality Incremental advance
AI Analysis

This addresses the problem of robust anomaly detection for industrial quality control, though it is incremental as it builds on existing methods with a new dataset and framework.

The paper tackles the challenge of logical anomaly detection in industrial inspection by introducing VID-AD, a dataset with controlled variations in visual distractions, and proposes a language-based framework that improves detection performance over baselines.

Logical anomaly detection in industrial inspection remains challenging due to variations in visual appearance (e.g., background clutter, illumination shift, and blur), which often distract vision-centric detectors from identifying rule-level violations. However, existing benchmarks rarely provide controlled settings where logical states are fixed while such nuisance factors vary. To address this gap, we introduce VID-AD, a dataset for logical anomaly detection under vision-induced distraction. It comprises 10 manufacturing scenarios and five capture conditions, totaling 50 one-class tasks and 10,395 images. Each scenario is defined by two logical constraints selected from quantity, length, type, placement, and relation, with anomalies including both single-constraint and combined violations. We further propose a language-based anomaly detection framework that relies solely on text descriptions generated from normal images. Using contrastive learning with positive texts and contradiction-based negative texts synthesized from these descriptions, our method learns embeddings that capture logical attributes rather than low-level features. Extensive experiments demonstrate consistent improvements over baselines across the evaluated settings. The dataset is available at: https://github.com/nkthiroto/VID-AD.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes