ML LGJul 2, 2024

The Epistemic Uncertainty Hole: an issue of Bayesian Neural Networks

arXiv:2407.01985v115.19 citationsh-index: 2

Originality Incremental advance

AI Analysis

This reveals a critical flaw in Bayesian Deep Learning that affects its reliability for uncertainty-aware applications, making it an incremental but important finding.

The paper identifies that Bayesian Neural Networks exhibit an 'epistemic uncertainty hole', where epistemic uncertainty collapses with large models or small training data, contrary to theoretical expectations, undermining their practical applications like out-of-distribution detection.

Bayesian Deep Learning (BDL) gives access not only to aleatoric uncertainty, as standard neural networks already do, but also to epistemic uncertainty, a measure of confidence a model has in its own predictions. In this article, we show through experiments that the evolution of epistemic uncertainty metrics regarding the model size and the size of the training set, goes against theoretical expectations. More precisely, we observe that the epistemic uncertainty collapses literally in the presence of large models and sometimes also of little training data, while we expect the exact opposite behaviour. This phenomenon, which we call "epistemic uncertainty hole", is all the more problematic as it undermines the entire applicative potential of BDL, which is based precisely on the use of epistemic uncertainty. As an example, we evaluate the practical consequences of this uncertainty hole on one of the main applications of BDL, namely the detection of out-of-distribution samples

View on arXiv PDF

Similar