LGMLJun 6, 2024

Regularized KL-Divergence for Well-Defined Function-Space Variational Inference in Bayesian neural networks

arXiv:2406.04317v34 citations
Originality Highly original
AI Analysis

This addresses a fundamental problem in Bayesian deep learning for safety-critical applications by enabling reliable uncertainty modeling with function-space priors.

The paper tackles the issue of infinite objective functions in function-space variational inference for Bayesian neural networks by introducing a regularized KL-divergence approach, resulting in well-defined variational objectives that incorporate Gaussian process priors and provide competitive uncertainty estimates on synthetic and real-world datasets.

Bayesian neural networks (BNN) promise to combine the predictive performance of neural networks with principled uncertainty modeling important for safety-critical systems and decision making. However, posterior uncertainty estimates depend on the choice of prior, and finding informative priors in weight-space has proven difficult. This has motivated variational inference (VI) methods that pose priors directly on the function generated by the BNN rather than on weights. In this paper, we address a fundamental issue with such function-space VI approaches pointed out by Burt et al. (2020), who showed that the objective function (ELBO) is negative infinite for most priors of interest. Our solution builds on generalized VI (Knoblauch et al., 2019) with the regularized KL divergence (Quang, 2019) and is, to the best of our knowledge, the first well-defined variational objective for function-space inference in BNNs with Gaussian process (GP) priors. Experiments show that our method incorporates the properties specified by the GP prior on synthetic and small real-world data sets, and provides competitive uncertainty estimates for regression, classification and out-of-distribution detection compared to BNN baselines with both function and weight-space priors.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes