CV LGJan 13, 2024

NODI: Out-Of-Distribution Detection with Noise from Diffusion

arXiv:2401.08689v25.24 citationsh-index: 19

Originality Incremental advance

AI Analysis

This work addresses the problem of safe deployment of machine learning models by improving OOD detection robustness across different image encoders, though it appears incremental as it builds on existing diffusion and OOD methods.

The paper tackled out-of-distribution (OOD) detection by introducing a diffusion model to compute OOD scores, achieving a 3.5% performance gain with an MAE-based image encoder and outperforming previous methods across various image encoders.

Out-of-distribution (OOD) detection is a crucial part of deploying machine learning models safely. It has been extensively studied with a plethora of methods developed in the literature. This problem is tackled with an OOD score computation, however, previous methods compute the OOD scores with limited usage of the in-distribution dataset. For instance, the OOD scores are computed with information from a small portion of the in-distribution data. Furthermore, these methods encode images with a neural image encoder. The robustness of these methods is rarely checked with respect to image encoders of different training methods and architectures. In this work, we introduce the diffusion process into the OOD task. The diffusion model integrates information on the whole training set into the predicted noise vectors. What's more, we deduce a closed-form solution for the noise vector (stable point). Then the noise vector is converted into our OOD score, we test both the deep model predicted noise vector and the closed-form noise vector on the OOD benchmarks \cite{openood}. Our method outperforms previous OOD methods across all types of image encoders (Table. \ref{main}). A $3.5\%$ performance gain is achieved with the MAE-based image encoder. Moreover, we studied the robustness of OOD methods by applying different types of image encoders. Some OOD methods failed to generalize well when switching image encoders from ResNet to Vision Transformers, our method performs exhibits good robustness with all the image encoders.

View on arXiv PDF

Similar