CV LGMay 19, 2021

Do We Really Need to Learn Representations from In-domain Data for Outlier Detection?

arXiv:2105.09270v111.121 citationsh-index: 30

Originality Incremental advance

AI Analysis

This work addresses the cost of training distinct representations for each outlier detection task, potentially reducing computational overhead for practitioners.

The paper investigates whether pre-trained networks can replace task-specific representation learning for unsupervised outlier detection, achieving competitive or better performance on various benchmarks.

Unsupervised outlier detection, which predicts if a test sample is an outlier or not using only the information from unlabelled inlier data, is an important but challenging task. Recently, methods based on the two-stage framework achieve state-of-the-art performance on this task. The framework leverages self-supervised representation learning algorithms to train a feature extractor on inlier data, and applies a simple outlier detector in the feature space. In this paper, we explore the possibility of avoiding the high cost of training a distinct representation for each outlier detection task, and instead using a single pre-trained network as the universal feature extractor regardless of the source of in-domain data. In particular, we replace the task-specific feature extractor by one network pre-trained on ImageNet with a self-supervised loss. In experiments, we demonstrate competitive or better performance on a variety of outlier detection benchmarks compared with previous two-stage methods, suggesting that learning representations from in-domain data may be unnecessary for outlier detection.

View on arXiv PDF

Similar