ML LGJul 19, 2015

Fast Adaptive Weight Noise

Justin Bayer, Maximilian Karl, Daniela Korhammer, Patrick van der Smagt

arXiv:1507.05331v15.13 citations

Originality Incremental advance

AI Analysis

This work addresses the need for faster and more stable Bayesian approximations in neural networks, though it is incremental as it builds on existing fast dropout techniques.

The authors tackled the problem of efficiently marginalizing uncertain quantities in neural networks by generalizing fast dropout to various noise processes, enabling efficient computation of marginal likelihood and predictive distribution without sampling. This approach achieved results competitive with previous neural network methods and Gaussian processes on multiple regression tasks.

Marginalising out uncertain quantities within the internal representations or parameters of neural networks is of central importance for a wide range of learning techniques, such as empirical, variational or full Bayesian methods. We set out to generalise fast dropout (Wang & Manning, 2013) to cover a wider variety of noise processes in neural networks. This leads to an efficient calculation of the marginal likelihood and predictive distribution which evades sampling and the consequential increase in training time due to highly variant gradient estimates. This allows us to approximate variational Bayes for the parameters of feed-forward neural networks. Inspired by the minimum description length principle, we also propose and experimentally verify the direct optimisation of the regularised predictive distribution. The methods yield results competitive with previous neural network based approaches and Gaussian processes on a wide range of regression tasks.

View on arXiv PDF

Similar