Self-Supervised Laplace Approximation for Bayesian Uncertainty Quantification
This work provides a deterministic, sampling-free approximation for Bayesian predictive uncertainty that is computationally efficient and applicable to various regression models, offering a practical tool for uncertainty quantification.
The authors propose Self-Supervised Laplace Approximation (SSLA) to directly approximate the posterior predictive distribution for Bayesian uncertainty quantification, bypassing the parameter posterior. Their method outperforms classical Laplace approximations in predictive calibration across regression tasks while remaining computationally efficient.
Approximate Bayesian inference typically revolves around computing the posterior parameter distribution. In practice, however, the main object of interest is often a model's predictions rather than its parameters. In this work, we propose to bypass the parameter posterior and focus directly on approximating the posterior predictive distribution. We achieve this by drawing inspiration from self-training within self-supervised and semi-supervised learning. Essentially, we quantify a Bayesian model's predictive uncertainty by refitting on self-predicted data. The idea is strikingly simple: If a model assigns high likelihood to self-predicted data, these predictions are of low uncertainty, and vice versa. This yields a deterministic, sampling-free approximation of the posterior predictive. The modular structure of our Self-Supervised Laplace Approximation (SSLA) further allows us to plug in different prior specifications, enabling classical Bayesian sensitivity (w.r.t. prior choice) analysis. In order to bypass expensive refitting, we further introduce an approximate version of SSLA, called ASSLA. We study (A)SSLA both theoretically and empirically in regression models ranging from Bayesian linear models to Bayesian neural networks. Across a wide array of regression tasks with simulated and real-world datasets, our methods outperform classical Laplace approximations in predictive calibration while remaining computationally efficient.