CVFeb 3

Interpretable Logical Anomaly Classification via Constraint Decomposition and Instruction Fine-Tuning

arXiv:2602.03530v1
Originality Incremental advance
AI Analysis

This work addresses the need for interpretable anomaly classification in industrial quality assurance, offering a fine-grained solution beyond binary detection, though it is incremental in building on existing vision-language models.

The paper tackles the problem of logical anomaly classification in industrial images by introducing LogiCls, a vision-language framework that decomposes constraints into subqueries and uses instruction fine-tuning, achieving robust and interpretable results with concrete performance gains in experiments.

Logical anomalies are violations of predefined constraints on object quantity, spatial layout, and compositional relationships in industrial images. While prior work largely treats anomaly detection as a binary decision, such formulations cannot indicate which logical rule is broken and therefore offer limited value for quality assurance. We introduce Logical Anomaly Classification (LAC), a task that unifies anomaly detection and fine-grained violation classification in a single inference step. To tackle LAC, we propose LogiCls, a vision-language framework that decomposes complex logical constraints into a sequence of verifiable subqueries. We further present a data-centric instruction synthesis pipeline that generates chain-of-thought (CoT) supervision for these subqueries, coupling precise grounding annotations with diverse image-text augmentations to adapt vision language models (VLMs) to logic-sensitive reasoning. Training is stabilized by a difficulty-aware resampling strategy that emphasizes challenging subqueries and long tail constraint types. Extensive experiments demonstrate that LogiCls delivers robust, interpretable, and accurate industrial logical anomaly classification, providing both the predicted violation categories and their evidence trails.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes