LGITDec 1, 2016

Deep Variational Information Bottleneck

arXiv:1612.00410v72174 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of enhancing model reliability for machine learning practitioners, though it is incremental as it builds on an existing theoretical framework.

The paper tackles the problem of improving generalization and robustness in neural networks by introducing a variational approximation to the information bottleneck, resulting in models that outperform others in these areas.

We present a variational approximation to the information bottleneck of Tishby et al. (1999). This variational approach allows us to parameterize the information bottleneck model using a neural network and leverage the reparameterization trick for efficient training. We call this method "Deep Variational Information Bottleneck", or Deep VIB. We show that models trained with the VIB objective outperform those that are trained with other forms of regularization, in terms of generalization performance and robustness to adversarial attack.

Code Implementations9 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes