MLLGJun 15, 2022

Query-Adaptive Predictive Inference with Partial Labels

arXiv:2206.07236v1h-index: 66
Originality Incremental advance
AI Analysis

This addresses the challenge of high labeling costs in machine learning by enabling model validation with cheaper, partially supervised data, though it is incremental in leveraging existing black-box models.

The paper tackles the problem of constructing predictive sets for structured prediction tasks using only partially labeled data, proposing a method that adapts to various tasks and demonstrates validity in experiments.

The cost and scarcity of fully supervised labels in statistical machine learning encourage using partially labeled data for model validation as a cheaper and more accessible alternative. Effectively collecting and leveraging weakly supervised data for large-space structured prediction tasks thus becomes an important part of an end-to-end learning system. We propose a new computationally-friendly methodology to construct predictive sets using only partially labeled data on top of black-box predictive models. To do so, we introduce "probe" functions as a way to describe weakly supervised instances and define a false discovery proportion-type loss, both of which seamlessly adapt to partial supervision and structured prediction -- ranking, matching, segmentation, multilabel or multiclass classification. Our experiments highlight the validity of our predictive set construction as well as the attractiveness of a more flexible user-dependent loss framework.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes