AIGTHCJun 8, 2025

Evaluating LLM-Contaminated Crowdsourcing Data Without Ground Truth

arXiv:2506.06991v23 citationsh-index: 5
Originality Incremental advance
AI Analysis

This addresses data quality issues for AI developers relying on crowdsourcing, though it builds incrementally on existing peer prediction techniques.

The paper tackles the problem of LLM contamination in crowdsourced annotation data by developing a peer prediction method that detects cheating without ground truth, demonstrating robustness on real-world datasets.

The recent success of generative AI highlights the crucial role of high-quality human feedback in building trustworthy AI systems. However, the increasing use of large language models (LLMs) by crowdsourcing workers poses a significant challenge: datasets intended to reflect human input may be compromised by LLM-generated responses. Existing LLM detection approaches often rely on high-dimensional training data such as text, making them unsuitable for annotation tasks like multiple-choice labeling. In this work, we investigate the potential of peer prediction -- a mechanism that evaluates the information within workers' responses without using ground truth -- to mitigate LLM-assisted cheating in crowdsourcing with a focus on annotation tasks. Our approach quantifies the correlations between worker answers while conditioning on (a subset of) LLM-generated labels available to the requester. Building on prior research, we propose a training-free scoring mechanism with theoretical guarantees under a crowdsourcing model that accounts for LLM collusion. We establish conditions under which our method is effective and empirically demonstrate its robustness in detecting low-effort cheating on real-world crowdsourcing datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes