MLJan 18, 2018

Overpruning in Variational Bayesian Neural Networks

arXiv:1801.06230v154 citations
Originality Incremental advance
AI Analysis

This addresses a counter-intuitive performance issue in variational Bayesian neural networks, which is incremental but clarifies a key design challenge.

The paper identifies variational over-pruning as a cause of worse predictions with more expressive variational approximations in neural networks, and provides a theoretical explanation for this phenomenon.

The motivations for using variational inference (VI) in neural networks differ significantly from those in latent variable models. This has a counter-intuitive consequence; more expressive variational approximations can provide significantly worse predictions as compared to those with less expressive families. In this work we make two contributions. First, we identify a cause of this performance gap, variational over-pruning. Second, we introduce a theoretically grounded explanation for this phenomenon. Our perspective sheds light on several related published results and provides intuition into the design of effective variational approximations of neural networks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes