DSAICLJun 3, 2025

Labelling Data with Unknown References

arXiv:2506.03083v31 citationsh-index: 6
Originality Incremental advance
AI Analysis

This addresses a critical issue for researchers and practitioners in machine learning who rely on evaluators without access to labeled data, though it is incremental as it builds on existing trustworthiness concepts.

The paper tackles the problem of establishing trust in evaluators when labeled reference data is unavailable, introducing the No-Data Algorithm that uses successive challenges to verify trustworthiness with high probability, and demonstrates applications to LLMs-as-judges in low-resource languages.

An evaluator is trustworthy when there exists some agreed-upon way to measure its performance as a labeller. The two ways to establish trustworthiness are either by testing it, or by assuming the evaluator `knows' somehow the way to label the corpus. However, if labelled references (e.g., a development set) are unavailable, neither of these approaches work: the former requires the data, and the latter is an assumption, not evidence. To address this, we introduce an algorithm (the `No-Data Algorithm') by which to establish trust in an evaluator without any existing references. Our algorithm works by successively posing challenges to said evaluator. We show that this is sufficient to establish trustworthiness w.h.p., in such a way that when the evaluator actually knows the way to label the corpus, the No-Data Algorithm accepts its output; and, conversely, flags untrustworthy evaluators when these are unable to prove it. We present formal proofs of correctness, empirical tests, and applications to LLMs-as-judges on low-resource languages.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes