LGCVApr 6, 2025

Active Learning with a Noisy Annotator

arXiv:2504.04506v14 citationsh-index: 46
Originality Incremental advance
AI Analysis

This addresses the issue of noisy labels in active learning for computer vision, but it is incremental as it extends existing coverage-based methods.

The paper tackles the problem of active learning struggling with noisy annotations in low-budget regimes by proposing Noise-Aware Active Sampling (NAS), which improves performance on benchmarks like CIFAR100 and ImageNet subsets.

Active Learning (AL) aims to reduce annotation costs by strategically selecting the most informative samples for labeling. However, most active learning methods struggle in the low-budget regime where only a few labeled examples are available. This issue becomes even more pronounced when annotators provide noisy labels. A common AL approach for the low- and mid-budget regimes focuses on maximizing the coverage of the labeled set across the entire dataset. We propose a novel framework called Noise-Aware Active Sampling (NAS) that extends existing greedy, coverage-based active learning strategies to handle noisy annotations. NAS identifies regions that remain uncovered due to the selection of noisy representatives and enables resampling from these areas. We introduce a simple yet effective noise filtering approach suitable for the low-budget regime, which leverages the inner mechanism of NAS and can be applied for noise filtering before model training. On multiple computer vision benchmarks, including CIFAR100 and ImageNet subsets, NAS significantly improves performance for standard active learning methods across different noise types and rates.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes