LGMLJul 14, 2021

Hybrid Bayesian Neural Networks with Functional Probabilistic Layers

arXiv:2107.07014v12 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of encoding intuitive prior knowledge in Bayesian neural networks for researchers in probabilistic deep learning, though it appears incremental as it builds on existing hybrid and Gaussian process methods.

The authors tackled the problem of specifying priors in Bayesian neural networks by proposing hybrid networks with functional probabilistic layers, which encode uncertainty on functions rather than weights, and demonstrated their approach through proof-of-concept experiments using a new library called GPflus.

Bayesian neural networks provide a direct and natural way to extend standard deep neural networks to support probabilistic deep learning through the use of probabilistic layers that, traditionally, encode weight (and bias) uncertainty. In particular, hybrid Bayesian neural networks utilize standard deterministic layers together with few probabilistic layers judicially positioned in the networks for uncertainty estimation. A major aspect and benefit of Bayesian inference is that priors, in principle, provide the means to encode prior knowledge for use in inference and prediction. However, it is difficult to specify priors on weights since the weights have no intuitive interpretation. Further, the relationships of priors on weights to the functions computed by networks are difficult to characterize. In contrast, functions are intuitive to interpret and are direct since they map inputs to outputs. Therefore, it is natural to specify priors on functions to encode prior knowledge, and to use them in inference and prediction based on functions. To support this, we propose hybrid Bayesian neural networks with functional probabilistic layers that encode function (and activation) uncertainty. We discuss their foundations in functional Bayesian inference, functional variational inference, sparse Gaussian processes, and sparse variational Gaussian processes. We further perform few proof-of-concept experiments using GPflus, a new library that provides Gaussian process layers and supports their use with deterministic Keras layers to form hybrid neural network and Gaussian process models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes