Nonparametric Distribution Regression Re-calibration

Ádám Jung, Domokos M. Kelen, András A. Benczúr

arXiv:2602.13362v11.7h-index: 7

Originality Highly original

AI Analysis

This addresses the need for trustworthy uncertainty estimates in safety-critical applications, offering a more flexible alternative to existing post-hoc calibration methods.

The paper tackles the problem of overconfident predictive distributions in probabilistic regression by proposing a nonparametric re-calibration algorithm using conditional kernel mean embeddings, which consistently outperforms prior methods across diverse benchmarks.

A key challenge in probabilistic regression is ensuring that predictive distributions accurately reflect true empirical uncertainty. Minimizing overall prediction error often encourages models to prioritize informativeness over calibration, producing narrow but overconfident predictions. However, in safety-critical settings, trustworthy uncertainty estimates are often more valuable than narrow intervals. Realizing the problem, several recent works have focused on post-hoc corrections; however, existing methods either rely on weak notions of calibration (such as PIT uniformity) or impose restrictive parametric assumptions on the nature of the error. To address these limitations, we propose a novel nonparametric re-calibration algorithm based on conditional kernel mean embeddings, capable of correcting calibration error without restrictive modeling assumptions. For efficient inference with real-valued targets, we introduce a novel characteristic kernel over distributions that can be evaluated in $\mathcal{O}(n \log n)$ time for empirical distributions of size $n$. We demonstrate that our method consistently outperforms prior re-calibration approaches across a diverse set of regression benchmarks and model classes.

View on arXiv PDF

Similar