LGApr 12, 2025

crowd-hpo: Realistic Hyperparameter Optimization and Benchmarking for Learning from Crowds with Noisy Labels

arXiv:2504.09085v2h-index: 9Trans. Mach. Learn. Res.
Originality Incremental advance
AI Analysis

This addresses the need for fairer and more realistic benchmarking in crowd-sourced learning, though it is incremental as it builds on existing methods by adding a hyperparameter optimization framework.

The paper tackles the problem of unfair and unrealistic hyperparameter tuning in learning from crowds with noisy labels by introducing crowd-hpo, a framework that selects hyperparameter configurations using only noisy crowd-labeled validation data, improving generalization performance on test sets with ground truth labels.

Crowdworking is a cost-efficient solution for acquiring class labels. Since these labels are subject to noise, various approaches to learning from crowds have been proposed. Typically, these approaches are evaluated with default hyperparameter configurations, resulting in unfair and suboptimal performance, or with hyperparameter configurations tuned via a validation set with ground truth class labels, representing an often unrealistic scenario. Moreover, both setups can produce different approach rankings, complicating study comparisons. Therefore, we introduce crowd-hpo as a framework for evaluating approaches to learning from crowds in combination with criteria to select well-performing hyperparameter configurations with access only to noisy crowd-labeled validation data. Extensive experiments with neural networks demonstrate that these criteria select hyperparameter configurations, which improve the learning from crowd approaches' generalization performances, measured on separate test sets with ground truth labels. Hence, incorporating such criteria into experimental studies is essential for enabling fairer and more realistic benchmarking.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes