LGHCJan 15, 2023

Who Should Predict? Exact Algorithms For Learning to Defer to Humans

MicrosoftMIT
arXiv:2301.06197v286 citationsh-index: 52
Originality Incremental advance
AI Analysis

This work addresses the challenge of improving accuracy in human-AI collaboration systems, offering new algorithms for learning to defer, though it is incremental with specific technical contributions.

The paper tackles the problem of jointly training a classifier and a rejector to decide when to defer predictions to a human, showing that prior methods can fail even in realizable settings and proving NP-hardness for linear pairs. It provides an optimal MILP solution for linear settings and a novel surrogate loss that performs well empirically on multiple datasets.

Automated AI classifiers should be able to defer the prediction to a human decision maker to ensure more accurate predictions. In this work, we jointly train a classifier with a rejector, which decides on each data point whether the classifier or the human should predict. We show that prior approaches can fail to find a human-AI system with low misclassification error even when there exists a linear classifier and rejector that have zero error (the realizable setting). We prove that obtaining a linear pair with low error is NP-hard even when the problem is realizable. To complement this negative result, we give a mixed-integer-linear-programming (MILP) formulation that can optimally solve the problem in the linear setting. However, the MILP only scales to moderately-sized problems. Therefore, we provide a novel surrogate loss function that is realizable-consistent and performs well empirically. We test our approaches on a comprehensive set of datasets and compare to a wide range of baselines.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes