CLAIIRNENov 24, 2018

Recurrently Controlled Recurrent Networks

arXiv:1811.09786v118 citations
Originality Incremental advance
AI Analysis

This work addresses the need for more expressive sequence modeling in NLP, offering a potential replacement for widely used stacked architectures, though it appears incremental as it builds on existing RNN frameworks.

The paper tackles the problem of improving sequence encoding in recurrent neural networks by proposing a Recurrently Controlled Recurrent Network (RCRN), which uses a controller cell to learn recurrent gating functions and influence a listener cell. The results show that RCRN consistently outperforms BiLSTMs and stacked BiLSTMs across 26 NLP datasets, including sentiment analysis, question classification, entailment classification, answer selection, and reading comprehension tasks.

Recurrent neural networks (RNNs) such as long short-term memory and gated recurrent units are pivotal building blocks across a broad spectrum of sequence modeling problems. This paper proposes a recurrently controlled recurrent network (RCRN) for expressive and powerful sequence encoding. More concretely, the key idea behind our approach is to learn the recurrent gating functions using recurrent networks. Our architecture is split into two components - a controller cell and a listener cell whereby the recurrent controller actively influences the compositionality of the listener cell. We conduct extensive experiments on a myriad of tasks in the NLP domain such as sentiment analysis (SST, IMDb, Amazon reviews, etc.), question classification (TREC), entailment classification (SNLI, SciTail), answer selection (WikiQA, TrecQA) and reading comprehension (NarrativeQA). Across all 26 datasets, our results demonstrate that RCRN not only consistently outperforms BiLSTMs but also stacked BiLSTMs, suggesting that our controller architecture might be a suitable replacement for the widely adopted stacked architecture.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes