NELGMLJun 12, 2013

Understanding Dropout: Training Multi-Layer Perceptrons with Auxiliary Independent Stochastic Neurons

arXiv:1306.2801v47.57 citations
Originality Incremental advance
AI Analysis

This work addresses training efficiency and regularization for neural networks, but it is incremental as it builds on existing dropout methods.

The paper tackles the problem of improving training in multi-layer perceptrons by proposing a method that adds auxiliary stochastic neurons, generalizing dropout and related techniques, and finds that using different dropping probabilities per layer is empirically viable.

In this paper, a simple, general method of adding auxiliary stochastic neurons to a multi-layer perceptron is proposed. It is shown that the proposed method is a generalization of recently successful methods of dropout (Hinton et al., 2012), explicit noise injection (Vincent et al., 2010; Bishop, 1995) and semantic hashing (Salakhutdinov & Hinton, 2009). Under the proposed framework, an extension of dropout which allows using separate dropping probabilities for different hidden neurons, or layers, is found to be available. The use of different dropping probabilities for hidden layers separately is empirically investigated.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes