CVDec 1, 2021

Label-Free Model Evaluation with Semi-Structured Dataset Representations

arXiv:2112.00694v112 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses the challenge of evaluating models in unseen environments without labels, which is incremental as it builds on prior methods by combining structured and unstructured representations.

The paper tackles the problem of label-free model evaluation (AutoEval) by proposing a semi-structured dataset representation that integrates distribution shapes, clusters, and representative samples to estimate model accuracy on unlabeled test sets, achieving competitive results on three existing and 25 new datasets.

Label-free model evaluation, or AutoEval, estimates model accuracy on unlabeled test sets, and is critical for understanding model behaviors in various unseen environments. In the absence of image labels, based on dataset representations, we estimate model performance for AutoEval with regression. On the one hand, image feature is a straightforward choice for such representations, but it hampers regression learning due to being unstructured (\ie no specific meanings for component at certain location) and of large-scale. On the other hand, previous methods adopt simple structured representations (like average confidence or average feature), but insufficient to capture the data characteristics given their limited dimensions. In this work, we take the best of both worlds and propose a new semi-structured dataset representation that is manageable for regression learning while containing rich information for AutoEval. Based on image features, we integrate distribution shapes, clusters, and representative samples for a semi-structured dataset representation. Besides the structured overall description with distribution shapes, the unstructured description with clusters and representative samples include additional fine-grained information facilitating the AutoEval task. On three existing datasets and 25 newly introduced ones, we experimentally show that the proposed representation achieves competitive results. Code and dataset are available at https://github.com/sxzrt/Semi-Structured-Dataset-Representations.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes