NELGDec 11, 2014

Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling

arXiv:1412.3555v114682 citations
Originality Synthesis-oriented
AI Analysis

This work provides empirical validation for using gated recurrent units in sequence modeling, which is important for researchers and practitioners in machine learning and AI, though it is incremental as it builds on existing methods.

The paper tackled the problem of comparing gated recurrent units like LSTM and GRU to traditional units in RNNs for sequence modeling tasks such as polyphonic music and speech signal modeling, finding that gated units outperform traditional ones and GRU is comparable to LSTM.

In this paper we compare different types of recurrent units in recurrent neural networks (RNNs). Especially, we focus on more sophisticated units that implement a gating mechanism, such as a long short-term memory (LSTM) unit and a recently proposed gated recurrent unit (GRU). We evaluate these recurrent units on the tasks of polyphonic music modeling and speech signal modeling. Our experiments revealed that these advanced recurrent units are indeed better than more traditional recurrent units such as tanh units. Also, we found GRU to be comparable to LSTM.

Code Implementations14 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes