CVLGMay 19, 2021

Do We Really Need to Learn Representations from In-domain Data for Outlier Detection?

arXiv:2105.09270v121 citations
Originality Incremental advance
AI Analysis

This work addresses the cost of training distinct representations for each outlier detection task, potentially reducing computational overhead for practitioners.

The paper investigates whether pre-trained networks can replace task-specific representation learning for unsupervised outlier detection, achieving competitive or better performance on various benchmarks.

Unsupervised outlier detection, which predicts if a test sample is an outlier or not using only the information from unlabelled inlier data, is an important but challenging task. Recently, methods based on the two-stage framework achieve state-of-the-art performance on this task. The framework leverages self-supervised representation learning algorithms to train a feature extractor on inlier data, and applies a simple outlier detector in the feature space. In this paper, we explore the possibility of avoiding the high cost of training a distinct representation for each outlier detection task, and instead using a single pre-trained network as the universal feature extractor regardless of the source of in-domain data. In particular, we replace the task-specific feature extractor by one network pre-trained on ImageNet with a self-supervised loss. In experiments, we demonstrate competitive or better performance on a variety of outlier detection benchmarks compared with previous two-stage methods, suggesting that learning representations from in-domain data may be unnecessary for outlier detection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes