CLMar 16, 2016

Recurrent Dropout without Memory Loss

Stanislau Semeniuta, Aliaksei Severyn, Erhardt Barth

arXiv:1603.05118v225.6227 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses a bottleneck in RNN regularization for NLP applications, but it is incremental as it builds on existing dropout techniques.

The paper tackles the problem of regularizing recurrent neural networks without losing long-term memory by proposing a dropout method applied to recurrent connections, and it shows consistent improvements on NLP benchmarks when combined with conventional dropout.

This paper presents a novel approach to recurrent neural network (RNN) regularization. Differently from the widely adopted dropout method, which is applied to \textit{forward} connections of feed-forward architectures or RNNs, we propose to drop neurons directly in \textit{recurrent} connections in a way that does not cause loss of long-term memory. Our approach is as easy to implement and apply as the regular feed-forward dropout and we demonstrate its effectiveness for Long Short-Term Memory network, the most popular type of RNN cells. Our experiments on NLP benchmarks show consistent improvements even when combined with conventional feed-forward dropout.

View on arXiv PDF Code

Similar