MLJan 18, 2018

Overpruning in Variational Bayesian Neural Networks

arXiv:1801.06230v118.654 citations

Originality Incremental advance

AI Analysis

This addresses a counter-intuitive performance issue in variational Bayesian neural networks, which is incremental but clarifies a key design challenge.

The paper identifies variational over-pruning as a cause of worse predictions with more expressive variational approximations in neural networks, and provides a theoretical explanation for this phenomenon.

The motivations for using variational inference (VI) in neural networks differ significantly from those in latent variable models. This has a counter-intuitive consequence; more expressive variational approximations can provide significantly worse predictions as compared to those with less expressive families. In this work we make two contributions. First, we identify a cause of this performance gap, variational over-pruning. Second, we introduce a theoretically grounded explanation for this phenomenon. Our perspective sheds light on several related published results and provides intuition into the design of effective variational approximations of neural networks.

View on arXiv PDF

Similar