LGMLAug 31, 2019

Epistemic Uncertainty Sampling

arXiv:1909.00218v156 citations
Originality Incremental advance
AI Analysis

This work addresses active learning for machine learning practitioners by introducing a novel uncertainty distinction, but it appears incremental as it builds on existing uncertainty sampling methods.

The paper tackles the problem of active learning by distinguishing between epistemic (reducible) and aleatoric (irreducible) uncertainty, proposing 'epistemic uncertainty sampling' as a better measure for querying instances. Experimental results show promising performance, though no concrete numbers are provided.

Various strategies for active learning have been proposed in the machine learning literature. In uncertainty sampling, which is among the most popular approaches, the active learner sequentially queries the label of those instances for which its current prediction is maximally uncertain. The predictions as well as the measures used to quantify the degree of uncertainty, such as entropy, are almost exclusively of a probabilistic nature. In this paper, we advocate a distinction between two different types of uncertainty, referred to as epistemic and aleatoric, in the context of active learning. Roughly speaking, these notions capture the reducible and the irreducible part of the total uncertainty in a prediction, respectively. We conjecture that, in uncertainty sampling, the usefulness of an instance is better reflected by its epistemic than by its aleatoric uncertainty. This leads us to suggest the principle of "epistemic uncertainty sampling", which we instantiate by means of a concrete approach for measuring epistemic and aleatoric uncertainty. In experimental studies, epistemic uncertainty sampling does indeed show promising performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes