LGCVMLApr 22, 2017

Robust, Deep and Inductive Anomaly Detection

arXiv:1704.06743v3171 citations
Originality Incremental advance
AI Analysis

This addresses the problem of robust and inductive anomaly detection for data analysts, though it is incremental as it builds on existing robust PCA and autoencoder methods.

The paper tackles the limitations of PCA for anomaly detection by introducing a robust autoencoder that learns a nonlinear subspace while handling corrupted data, achieving effectiveness across real-world datasets.

PCA is a classical statistical technique whose simplicity and maturity has seen it find widespread use as an anomaly detection technique. However, it is limited in this regard by being sensitive to gross perturbations of the input, and by seeking a linear subspace that captures normal behaviour. The first issue has been dealt with by robust PCA, a variant of PCA that explicitly allows for some data points to be arbitrarily corrupted, however, this does not resolve the second issue, and indeed introduces the new issue that one can no longer inductively find anomalies on a test set. This paper addresses both issues in a single model, the robust autoencoder. This method learns a nonlinear subspace that captures the majority of data points, while allowing for some data to have arbitrary corruption. The model is simple to train and leverages recent advances in the optimisation of deep neural networks. Experiments on a range of real-world datasets highlight the model's effectiveness.

Code Implementations5 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes