CVAILGMar 15, 2024

Perceptual Quality-based Model Training under Annotator Label Uncertainty

arXiv:2403.10190v12 citationsh-index: 14Third International Meeting for Applied Geoscience & Energy Expanded Abstracts
Originality Incremental advance
AI Analysis

This work addresses annotator label uncertainty in machine learning, offering a method to enhance model reliability with reduced annotation effort, though it is incremental as it builds on prior multi-annotator approaches.

The paper tackles the problem of model reliability degradation due to low-quality noisy labels from annotator disagreement by introducing a perceptual quality-based training framework that generates multiple labels without requiring massive annotations, resulting in improved generalizability and reduced prediction uncertainty.

Annotators exhibit disagreement during data labeling, which can be termed as annotator label uncertainty. Annotator label uncertainty manifests in variations of labeling quality. Training with a single low-quality annotation per sample induces model reliability degradations. In this work, we first examine the effects of annotator label uncertainty in terms of the model's generalizability and prediction uncertainty. We observe that the model's generalizability and prediction uncertainty degrade with the presence of low-quality noisy labels. Meanwhile, our evaluation of existing uncertainty estimation algorithms indicates their incapability in response to annotator label uncertainty. To mitigate performance degradation, prior methods show that training models with labels collected from multiple independent annotators can enhance generalizability. However, they require massive annotations. Hence, we introduce a novel perceptual quality-based model training framework to objectively generate multiple labels for model training to enhance reliability, while avoiding massive annotations. Specifically, we first select a subset of samples with low perceptual quality scores ranked by statistical regularities of visual signals. We then assign de-aggregated labels to each sample in this subset to obtain a training set with multiple labels. Our experiments and analysis demonstrate that training with the proposed framework alleviates the degradation of generalizability and prediction uncertainty caused by annotator label uncertainty.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes