LGAINov 18, 2025

Soft-Label Training Preserves Epistemic Uncertainty

arXiv:2511.14117v13 citations
Originality Highly original
AI Analysis

This addresses the misalignment between model certainty and human perception in ambiguous tasks, offering a foundational shift in handling subjective data.

The paper tackles the problem of models expressing false confidence on ambiguous data by proposing soft-label training, which treats annotation distributions as ground truth. The result shows 32% lower KL divergence from human annotations and 61% stronger correlation between model and annotation entropy, while matching hard-label training accuracy.

Many machine learning tasks involve inherent subjectivity, where annotators naturally provide varied labels. Standard practice collapses these label distributions into single labels, aggregating diverse human judgments into point estimates. We argue that this approach is epistemically misaligned for ambiguous data--the annotation distribution itself should be regarded as the ground truth. Training on collapsed single labels forces models to express false confidence on fundamentally ambiguous cases, creating a misalignment between model certainty and the diversity of human perception. We demonstrate empirically that soft-label training, which treats annotation distributions as ground truth, preserves epistemic uncertainty. Across both vision and NLP tasks, soft-label training achieves 32% lower KL divergence from human annotations and 61% stronger correlation between model and annotation entropy, while matching the accuracy of hard-label training. Our work repositions annotation distributions from noisy signals to be aggregated away, to faithful representations of epistemic uncertainty that models should learn to reproduce.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes