LGAIJun 1, 2025

Principled Input-Output-Conditioned Post-Hoc Uncertainty Estimation for Regression Networks

arXiv:2506.00918v11 citationsh-index: 4Has Code
Originality Highly original
AI Analysis

This addresses the need for reliable uncertainty quantification in safety-sensitive applications, offering a practical solution for retrofitting uncertainty estimates to off-the-shelf neural networks.

The paper tackles the problem of post-hoc uncertainty estimation for regression networks by proposing a framework that uses inputs and frozen model outputs to fit an auxiliary model, achieving enhanced out-of-distribution detection and metric performance without requiring model parameters or gradients.

Uncertainty quantification is critical in safety-sensitive applications but is often omitted from off-the-shelf neural networks due to adverse effects on predictive performance. Retrofitting uncertainty estimates post-hoc typically requires access to model parameters or gradients, limiting feasibility in practice. We propose a theoretically grounded framework for post-hoc uncertainty estimation in regression tasks by fitting an auxiliary model to both original inputs and frozen model outputs. Drawing from principles of maximum likelihood estimation and sequential parameter fitting, we formalize an exact post-hoc optimization objective that recovers the canonical MLE of Gaussian parameters, without requiring sampling or approximation at inference. While prior work has used model outputs to estimate uncertainty, we explicitly characterize the conditions under which this is valid and demonstrate the extent to which structured outputs can support quasi-epistemic inference. We find that using diverse auxiliary data, such as augmented subsets of the original training data, significantly enhances OOD detection and metric performance. Our hypothesis that frozen model outputs contain generalizable latent information about model error and predictive uncertainty is tested and confirmed. Finally, we ensure that our method maintains proper estimation of input-dependent uncertainty without relying exclusively on base model forecasts. These findings are demonstrated in toy problems and adapted to both UCI and depth regression benchmarks. Code: https://github.com/biggzlar/IO-CUE.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes