LGCVMLFeb 21, 2025

Logit Disagreement: OoD Detection with Bayesian Neural Networks

arXiv:2502.15648v11 citationsh-index: 1ECCV Workshops
Originality Incremental advance
AI Analysis

This work addresses out-of-distribution detection for machine learning safety, but it is incremental as it builds on existing Bayesian methods with specific refinements.

The paper tackled the problem of out-of-distribution detection using Bayesian neural networks by proposing new epistemic uncertainty scores based on logit disagreement, which showed marked improvements over mutual information and performed on par with predictive entropy in experiments on MNIST and CIFAR10.

Bayesian neural networks (BNNs), which estimate the full posterior distribution over model parameters, are well-known for their role in uncertainty quantification and its promising application in out-of-distribution detection (OoD). Amongst other uncertainty measures, BNNs provide a state-of-the art estimation of predictive entropy (total uncertainty) which can be decomposed as the sum of mutual information and expected entropy. In the context of OoD detection the estimation of predictive uncertainty in the form of the predictive entropy score confounds aleatoric and epistemic uncertainty, the latter being hypothesized to be high for OoD points. Despite these justifications, the mutual information score has been shown to perform worse than predictive entropy. Taking inspiration from Bayesian variational autoencoder (BVAE) literature, this work proposes to measure the disagreement between a corrected version of the pre-softmax quantities, otherwise known as logits, as an estimate of epistemic uncertainty for Bayesian NNs under mean field variational inference. The three proposed epistemic uncertainty scores demonstrate marked improvements over mutual information on a range of OoD experiments, with equal performance otherwise. Moreover, the epistemic uncertainty scores perform on par with the Bayesian benchmark predictive entropy on a range of MNIST and CIFAR10 experiments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes