MLLGSTOct 13, 2023

The surrogate Gibbs-posterior of a corrected stochastic MALA: Towards uncertainty quantification for neural networks

arXiv:2310.09335v21 citationsh-index: 92
Originality Incremental advance
AI Analysis

This work addresses uncertainty quantification for neural networks, offering a scalable method with theoretical guarantees, though it is incremental as it builds on existing stochastic MALA approaches.

The paper tackles the problem of scaling Bayesian uncertainty quantification for neural networks by introducing a corrected stochastic MALA (csMALA) that reduces the distance between a surrogate posterior and the original Gibbs-posterior while maintaining scalability, achieving optimal contraction rates for deep neural networks in simulations.

MALA is a popular gradient-based Markov chain Monte Carlo method to access the Gibbs-posterior distribution. Stochastic MALA (sMALA) scales to large data sets, but changes the target distribution from the Gibbs-posterior to a surrogate posterior which only exploits a reduced sample size. We introduce a corrected stochastic MALA (csMALA) with a simple correction term for which distance between the resulting surrogate posterior and the original Gibbs-posterior decreases in the full sample size while retaining scalability. In a nonparametric regression model, we prove a PAC-Bayes oracle inequality for the surrogate posterior. Uncertainties can be quantified by sampling from the surrogate posterior. Focusing on Bayesian neural networks, we analyze the diameter and coverage of credible balls for shallow neural networks and we show optimal contraction rates for deep neural networks. Our credibility result is independent of the correction and can also be applied to the standard Gibbs-posterior. A simulation study in a high-dimensional parameter space demonstrates that an estimator drawn from csMALA based on its surrogate Gibbs-posterior indeed exhibits these advantages in practice.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes