LGCVAug 26, 2023

Out-of-distribution detection using normalizing flows on the data manifold

arXiv:2308.13792v21 citationsh-index: 17
Originality Incremental advance
AI Analysis

This addresses the issue of unreliable out-of-distribution detection in machine learning models, particularly for safety-critical applications, but is incremental as it builds on existing methods.

The paper tackled the problem of out-of-distribution detection by proposing a criterion that combines density estimation on a low-dimensional manifold with a modified likelihood measure based on data complexity, showing improved detection ability without structural changes or auxiliary data.

Using the intuition that out-of-distribution data have lower likelihoods, a common approach for out-of-distribution detection involves estimating the underlying data distribution. Normalizing flows are likelihood-based generative models providing a tractable density estimation via dimension-preserving invertible transformations. Conventional normalizing flows are prone to fail in out-of-distribution detection, because of the well-known curse of dimensionality problem of the likelihood-based models. To solve the problem of likelihood-based models, some works try to modify likelihood for example by incorporating a data complexity measure. We observed that these modifications are still insufficient. According to the manifold hypothesis, real-world data often lie on a low-dimensional manifold. Therefore, we proceed by estimating the density on a low-dimensional manifold and calculating a distance from the manifold as a measure for out-of-distribution detection. We propose a powerful criterion that combines this measure with the modified likelihood measure based on data complexity. Extensive experimental results show that incorporating manifold learning while accounting for the estimation of data complexity improves the out-of-distribution detection ability of normalizing flows. This improvement is achieved without modifying the model structure or using auxiliary out-of-distribution data during training.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes