LGMLMay 26, 2022

Selective Prediction via Training Dynamics

U of Toronto
arXiv:2205.13532v424 citationsh-index: 31
Originality Incremental advance
AI Analysis

This addresses the practical challenge of selective prediction for machine learning practitioners by enabling rejection without modifying model architecture or training, though it is incremental as it builds on existing approaches.

The paper tackles the selective prediction problem by proposing a method that uses training dynamics to reject inputs the model would predict incorrectly, achieving state-of-the-art accuracy/utility trade-offs on benchmarks.

Selective Prediction is the task of rejecting inputs a model would predict incorrectly on. This involves a trade-off between input space coverage (how many data points are accepted) and model utility (how good is the performance on accepted data points). Current methods for selective prediction typically impose constraints on either the model architecture or the optimization objective; this inhibits their usage in practice and introduces unknown interactions with pre-existing loss functions. In contrast to prior work, we show that state-of-the-art selective prediction performance can be attained solely from studying the (discretized) training dynamics of a model. We propose a general framework that, given a test input, monitors metrics capturing the instability of predictions from intermediate models (i.e., checkpoints) obtained during training w.r.t. the final model's prediction. In particular, we reject data points exhibiting too much disagreement with the final prediction at late stages in training. The proposed rejection mechanism is domain-agnostic (i.e., it works for both discrete and real-valued prediction) and can be flexibly combined with existing selective prediction approaches as it does not require any train-time modifications. Our experimental evaluation on image classification, regression, and time series problems shows that our method beats past state-of-the-art accuracy/utility trade-offs on typical selective prediction benchmarks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes