CVMar 25, 2023

EfficientAD: Accurate Visual Anomaly Detection at Millisecond-Level Latencies

arXiv:2303.14535v3331 citationsh-index: 7
Originality Incremental advance
AI Analysis

This work addresses the need for efficient and accurate anomaly detection in real-time computer vision applications, such as industrial settings, though it is incremental in improving existing student-teacher approaches.

The paper tackles the problem of real-time visual anomaly detection by proposing EfficientAD, a lightweight method that processes images in under a millisecond on a GPU, achieving low error rates and enabling fast anomaly handling with a latency of two milliseconds and throughput of six hundred images per second.

Detecting anomalies in images is an important task, especially in real-time computer vision applications. In this work, we focus on computational efficiency and propose a lightweight feature extractor that processes an image in less than a millisecond on a modern GPU. We then use a student-teacher approach to detect anomalous features. We train a student network to predict the extracted features of normal, i.e., anomaly-free training images. The detection of anomalies at test time is enabled by the student failing to predict their features. We propose a training loss that hinders the student from imitating the teacher feature extractor beyond the normal images. It allows us to drastically reduce the computational cost of the student-teacher model, while improving the detection of anomalous features. We furthermore address the detection of challenging logical anomalies that involve invalid combinations of normal local features, for example, a wrong ordering of objects. We detect these anomalies by efficiently incorporating an autoencoder that analyzes images globally. We evaluate our method, called EfficientAD, on 32 datasets from three industrial anomaly detection dataset collections. EfficientAD sets new standards for both the detection and the localization of anomalies. At a latency of two milliseconds and a throughput of six hundred images per second, it enables a fast handling of anomalies. Together with its low error rate, this makes it an economical solution for real-world applications and a fruitful basis for future research.

Code Implementations33 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes