LGMar 18, 2022

Decision-Making under Miscalibration

arXiv:2203.09852v16 citationsh-index: 43
Originality Incremental advance
AI Analysis

This addresses decision-making under uncertainty in high-stakes domains like healthcare, though it is incremental as it builds on existing calibration theory.

The paper tackles the problem of making binary decisions using potentially miscalibrated machine learning predictions, such as in medical procedures, by proposing a threshold that minimizes worst-case regret under anticipated miscalibration, and validates this approach on real data to show improved clinical utility.

ML-based predictions are used to inform consequential decisions about individuals. How should we use predictions (e.g., risk of heart attack) to inform downstream binary classification decisions (e.g., undergoing a medical procedure)? When the risk estimates are perfectly calibrated, the answer is well understood: a classification problem's cost structure induces an optimal treatment threshold $j^{\star}$. In practice, however, some amount of miscalibration is unavoidable, raising a fundamental question: how should one use potentially miscalibrated predictions to inform binary decisions? We formalize a natural (distribution-free) solution concept: given anticipated miscalibration of $α$, we propose using the threshold $j$ that minimizes the worst-case regret over all $α$-miscalibrated predictors, where the regret is the difference in clinical utility between using the threshold in question and using the optimal threshold in hindsight. We provide closed form expressions for $j$ when miscalibration is measured using both expected and maximum calibration error, which reveal that it indeed differs from $j^{\star}$ (the optimal threshold under perfect calibration). We validate our theoretical findings on real data, demonstrating that there are natural cases in which making decisions using $j$ improves the clinical utility.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes