LGMLDec 30, 2024

Rethinking Aleatoric and Epistemic Uncertainty

MicrosoftOxford
arXiv:2412.20892v330 citationsh-index: 28ICML
Originality Incremental advance
AI Analysis

This work addresses foundational conceptual issues in uncertainty quantification for machine learning researchers, offering a framework to support clearer thinking in the field.

The paper identifies incoherence in existing discussions of aleatoric and epistemic uncertainty, arguing that these concepts are insufficiently expressive, and proposes a decision-theoretic perspective to clarify uncertainty, predictive performance, and statistical dispersion, while critiquing information-theoretic quantities as poor estimators but useful for data acquisition.

The ideas of aleatoric and epistemic uncertainty are widely used to reason about the probabilistic predictions of machine-learning models. We identify incoherence in existing discussions of these ideas and suggest this stems from the aleatoric-epistemic view being insufficiently expressive to capture all the distinct quantities that researchers are interested in. To address this we present a decision-theoretic perspective that relates rigorous notions of uncertainty, predictive performance and statistical dispersion in data. This serves to support clearer thinking as the field moves forward. Additionally we provide insights into popular information-theoretic quantities, showing they can be poor estimators of what they are often purported to measure, while also explaining how they can still be useful in guiding data acquisition.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes