MLFeb 18, 2015

Probabilistic Backpropagation for Scalable Learning of Bayesian Neural Networks

José Miguel Hernández-Lobato, Ryan P. Adams

arXiv:1502.05336v245.61014 citations

Originality Highly original

AI Analysis

This work addresses the problem of making Bayesian neural networks scalable for practitioners dealing with large datasets and network sizes, representing a novel method for a known bottleneck.

The authors tackled the scalability limitations of Bayesian neural networks by introducing probabilistic backpropagation (PBP), a method that is significantly faster than existing techniques while offering competitive predictive performance on ten real-world datasets.

Large multilayer neural networks trained with backpropagation have recently achieved state-of-the-art results in a wide range of problems. However, using backprop for neural net learning still has some disadvantages, e.g., having to tune a large number of hyperparameters to the data, lack of calibrated probabilistic predictions, and a tendency to overfit the training data. In principle, the Bayesian approach to learning neural networks does not have these problems. However, existing Bayesian techniques lack scalability to large dataset and network sizes. In this work we present a novel scalable method for learning Bayesian neural networks, called probabilistic backpropagation (PBP). Similar to classical backpropagation, PBP works by computing a forward propagation of probabilities through the network and then doing a backward computation of gradients. A series of experiments on ten real-world datasets show that PBP is significantly faster than other techniques, while offering competitive predictive abilities. Our experiments also show that PBP provides accurate estimates of the posterior variance on the network weights.

View on arXiv PDF

Similar