MLLGMay 19

Density-Ratio Losses for Post-Hoc Learning to Defer

arXiv:2605.1955752.6
Predicted impact top 25% in ML · last 90 daysOriginality Incremental advance
AI Analysis

For practitioners needing flexible deferral systems, this work provides a principled post-hoc approach that bridges Chow's rule and expert comparison, though it is incremental over existing L2D methods.

The paper introduces density-ratio based losses for post-hoc learning to defer, enabling deferral rate adjustment without retraining. The method achieves competitive performance and robustness across datasets compared to baselines.

We study post-hoc Learning to Defer (L2D) through the lens of ideal distributions: divergence-regularized reweightings of the data distribution under which a model attains low loss. We define deferral via the density-ratio between a model's and an expert's ideals. Using the reduction from density-ratio estimation to class-probability estimation, we derive the DR CPE losses for post-hoc L2D scorers. Deferral decisions are then made by thresholding the scorer, allowing deferral rates to be adjusted without retraining. For KL-based ideal distributions, our deferral rules recovers Chow's rule under the original distribution and a connection to an expert-tilted Bayes posterior -- which incorporates the expert's performance -- depending on if the ideal distributions are joint or marginal distributions. Experimentally, our approach is competitive compared to common baselines and more robust across dataset settings. More broadly, our results cast post-hoc L2D as density-ratio learning between ideal distributions, bridging Chow-style rules, expert comparison, and elucidating connections to related learning settings including anomaly detection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes