MLFeb 18, 2015

Probabilistic Backpropagation for Scalable Learning of Bayesian Neural Networks

arXiv:1502.05336v21014 citations
AI Analysis

This work addresses the problem of making Bayesian neural networks scalable for practitioners dealing with large datasets and network sizes, representing a novel method for a known bottleneck.

The authors tackled the scalability limitations of Bayesian neural networks by introducing probabilistic backpropagation (PBP), a method that is significantly faster than existing techniques while offering competitive predictive performance on ten real-world datasets.

Large multilayer neural networks trained with backpropagation have recently achieved state-of-the-art results in a wide range of problems. However, using backprop for neural net learning still has some disadvantages, e.g., having to tune a large number of hyperparameters to the data, lack of calibrated probabilistic predictions, and a tendency to overfit the training data. In principle, the Bayesian approach to learning neural networks does not have these problems. However, existing Bayesian techniques lack scalability to large dataset and network sizes. In this work we present a novel scalable method for learning Bayesian neural networks, called probabilistic backpropagation (PBP). Similar to classical backpropagation, PBP works by computing a forward propagation of probabilities through the network and then doing a backward computation of gradients. A series of experiments on ten real-world datasets show that PBP is significantly faster than other techniques, while offering competitive predictive abilities. Our experiments also show that PBP provides accurate estimates of the posterior variance on the network weights.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes