CVMay 22, 2023

Label Smarter, Not Harder: CleverLabel for Faster Annotation of Ambiguous Image Classification with Higher Quality

arXiv:2305.12811v14 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of efficiently labeling large datasets with higher quality for machine learning practitioners, though it appears incremental as it builds on proposal-guided annotations.

The paper tackled the problem of costly and ambiguous image classification labeling by proposing CleverLabel, a method that reduces labeling costs by up to 30.0% and improves label quality with a relative Kullback-Leibler divergence gain of up to 29.8% compared to previous state-of-the-art.

High-quality data is crucial for the success of machine learning, but labeling large datasets is often a time-consuming and costly process. While semi-supervised learning can help mitigate the need for labeled data, label quality remains an open issue due to ambiguity and disagreement among annotators. Thus, we use proposal-guided annotations as one option which leads to more consistency between annotators. However, proposing a label increases the probability of the annotators deciding in favor of this specific label. This introduces a bias which we can simulate and remove. We propose a new method CleverLabel for Cost-effective LabEling using Validated proposal-guidEd annotations and Repaired LABELs. CleverLabel can reduce labeling costs by up to 30.0%, while achieving a relative improvement in Kullback-Leibler divergence of up to 29.8% compared to the previous state-of-the-art on a multi-domain real-world image classification benchmark. CleverLabel offers a novel solution to the challenge of efficiently labeling large datasets while also improving the label quality.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes