LGAICVMar 16, 2024

Enhancing Out-of-Distribution Detection with Multitesting-based Layer-wise Feature Fusion

arXiv:2403.10803v12 citationsh-index: 2CAI
Originality Incremental advance
AI Analysis

This addresses the challenge of reliable model deployment in open environments for ML practitioners, but it is incremental as it builds on existing distance-based methods.

The paper tackles the problem of detecting out-of-distribution samples in machine learning by proposing a novel framework that uses multitesting-based layer-wise feature fusion, resulting in a significant reduction in false positive rate from 24.09% to 7.47% on CIFAR10.

Deploying machine learning in open environments presents the challenge of encountering diverse test inputs that differ significantly from the training data. These out-of-distribution samples may exhibit shifts in local or global features compared to the training distribution. The machine learning (ML) community has responded with a number of methods aimed at distinguishing anomalous inputs from original training data. However, the majority of previous studies have primarily focused on the output layer or penultimate layer of pre-trained deep neural networks. In this paper, we propose a novel framework, Multitesting-based Layer-wise Out-of-Distribution (OOD) Detection (MLOD), to identify distributional shifts in test samples at different levels of features through rigorous multiple testing procedure. Our approach distinguishes itself from existing methods as it does not require modifying the structure or fine-tuning of the pre-trained classifier. Through extensive experiments, we demonstrate that our proposed framework can seamlessly integrate with any existing distance-based inspection method while efficiently utilizing feature extractors of varying depths. Our scheme effectively enhances the performance of out-of-distribution detection when compared to baseline methods. In particular, MLOD-Fisher achieves superior performance in general. When trained using KNN on CIFAR10, MLOD-Fisher significantly lowers the false positive rate (FPR) from 24.09% to 7.47% on average compared to merely utilizing the features of the last layer.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes