LGAICVNENov 21, 2016

Generalized Dropout

arXiv:1611.06791v148 citations
Originality Incremental advance
AI Analysis

This work addresses the need for better regularization techniques in deep learning, offering incremental improvements over existing methods like Dropout.

The paper tackles the problem of improving generalization in deep neural networks by introducing Generalized Dropout, a family of regularizers that includes methods like Dropout++ with trainable parameters, and shows that these methods enhance generalization performance compared to standard Dropout.

Deep Neural Networks often require good regularizers to generalize well. Dropout is one such regularizer that is widely used among Deep Learning practitioners. Recent work has shown that Dropout can also be viewed as performing Approximate Bayesian Inference over the network parameters. In this work, we generalize this notion and introduce a rich family of regularizers which we call Generalized Dropout. One set of methods in this family, called Dropout++, is a version of Dropout with trainable parameters. Classical Dropout emerges as a special case of this method. Another member of this family selects the width of neural network layers. Experiments show that these methods help in improving generalization performance over Dropout.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes