CLMar 16, 2016

Recurrent Dropout without Memory Loss

arXiv:1603.05118v2227 citations
Originality Incremental advance
AI Analysis

This addresses a bottleneck in RNN regularization for NLP applications, but it is incremental as it builds on existing dropout techniques.

The paper tackles the problem of regularizing recurrent neural networks without losing long-term memory by proposing a dropout method applied to recurrent connections, and it shows consistent improvements on NLP benchmarks when combined with conventional dropout.

This paper presents a novel approach to recurrent neural network (RNN) regularization. Differently from the widely adopted dropout method, which is applied to \textit{forward} connections of feed-forward architectures or RNNs, we propose to drop neurons directly in \textit{recurrent} connections in a way that does not cause loss of long-term memory. Our approach is as easy to implement and apply as the regular feed-forward dropout and we demonstrate its effectiveness for Long Short-Term Memory network, the most popular type of RNN cells. Our experiments on NLP benchmarks show consistent improvements even when combined with conventional feed-forward dropout.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes