CVJan 21

DevPrompt: Deviation-Based Prompt Learning for One-Normal ShotImage Anomaly Detection

arXiv:2601.15453v1

Originality Incremental advance

AI Analysis

This work addresses the problem of detecting anomalies in images with limited normal samples for industrial inspection or medical imaging, representing an incremental improvement over prior prompt-based methods.

The paper tackles the challenge of few-normal shot anomaly detection in images by proposing a deviation-guided prompt learning framework that integrates vision-language models with statistical scoring, achieving superior pixel-level detection performance on benchmarks like MVTecAD and VISA compared to existing methods.

Few-normal shot anomaly detection (FNSAD) aims to detect abnormal regions in images using only a few normal training samples, making the task highly challenging due to limited supervision and the diversity of potential defects. Recent approaches leverage vision-language models such as CLIP with prompt-based learning to align image and text features. However, existing methods often exhibit weak discriminability between normal and abnormal prompts and lack principled scoring mechanisms for patch-level anomalies. We propose a deviation-guided prompt learning framework that integrates the semantic power of vision-language models with the statistical reliability of deviation-based scoring. Specifically, we replace fixed prompt prefixes with learnable context vectors shared across normal and abnormal prompts, while anomaly-specific suffix tokens enable class-aware alignment. To enhance separability, we introduce a deviation loss with Top-K Multiple Instance Learning (MIL), modeling patch-level features as Gaussian deviations from the normal distribution. This allows the network to assign higher anomaly scores to patches with statistically significant deviations, improving localization and interpretability. Experiments on the MVTecAD and VISA benchmarks demonstrate superior pixel-level detection performance compared to PromptAD and other baselines. Ablation studies further validate the effectiveness of learnable prompts, deviation-based scoring, and the Top-K MIL strategy.

View on arXiv PDF

Similar