LGFeb 4

Probabilistic Label Spreading: Efficient and Consistent Estimation of Soft Labels with Epistemic Uncertainty on Graphs

arXiv:2602.04574v1h-index: 3
Originality Highly original
AI Analysis

This addresses the problem of scalable and reliable label estimation for AI perception tasks, offering a practical solution to reduce annotation costs while improving label quality.

The paper tackles the challenge of estimating aleatoric and epistemic uncertainty in labels for perception tasks, where high-quality annotations are scarce, by introducing a probabilistic label spreading method that reduces the annotation budget needed for desired label quality and achieves state-of-the-art results on the Data-Centric Image Classification benchmark.

Safe artificial intelligence for perception tasks remains a major challenge, partly due to the lack of data with high-quality labels. Annotations themselves are subject to aleatoric and epistemic uncertainty, which is typically ignored during annotation and evaluation. While crowdsourcing enables collecting multiple annotations per image to estimate these uncertainties, this approach is impractical at scale due to the required annotation effort. We introduce a probabilistic label spreading method that provides reliable estimates of aleatoric and epistemic uncertainty of labels. Assuming label smoothness over the feature space, we propagate single annotations using a graph-based diffusion method. We prove that label spreading yields consistent probability estimators even when the number of annotations per data point converges to zero. We present and analyze a scalable implementation of our method. Experimental results indicate that, compared to baselines, our approach substantially reduces the annotation budget required to achieve a desired label quality on common image datasets and achieves a new state of the art on the Data-Centric Image Classification benchmark.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes