CVJul 30, 2024

dopanim: A Dataset of Doppelganger Animals with Noisy Annotations from Multiple Humans

Marek Herde, Denis Huseljic, Lukas Rauch, Bernhard Sick

arXiv:2407.20950v16.55 citationsh-index: 9Has Code

Originality Synthesis-oriented

AI Analysis

This provides a dataset for evaluating methods to counteract noisy annotations, addressing a common issue in machine learning, though it is incremental as it focuses on a specific domain of animal classification.

The authors tackled the problem of noisy human annotations impairing machine learning generalization by introducing dopanim, a benchmark dataset of about 15,750 animal images with 15 classes, where 20 humans provided over 52,000 annotations with approximately 67% accuracy.

Human annotators typically provide annotated data for training machine learning models, such as neural networks. Yet, human annotations are subject to noise, impairing generalization performances. Methodological research on approaches counteracting noisy annotations requires corresponding datasets for a meaningful empirical evaluation. Consequently, we introduce a novel benchmark dataset, dopanim, consisting of about 15,750 animal images of 15 classes with ground truth labels. For approximately 10,500 of these images, 20 humans provided over 52,000 annotations with an accuracy of circa 67%. Its key attributes include (1) the challenging task of classifying doppelganger animals, (2) human-estimated likelihoods as annotations, and (3) annotator metadata. We benchmark well-known multi-annotator learning approaches using seven variants of this dataset and outline further evaluation use cases such as learning beyond hard class labels and active learning. Our dataset and a comprehensive codebase are publicly available to emulate the data collection process and to reproduce all empirical results.

View on arXiv PDF Code

Similar