LGFeb 25, 2016

Hierarchical Conflict Propagation: Sequence Learning in a Recurrent Deep Neural Network

arXiv:1602.08118v1
Originality Highly original
AI Analysis

This addresses a known bottleneck in RNN training for sequence learning, offering a potential improvement over gradient descent methods.

The authors tackled the problem of training recurrent neural networks (RNNs) to learn long-term dependencies by introducing a novel method using parallel cloned networks with hierarchical conflict propagation, demonstrating its effectiveness on a character-level deep RNN tasked with memorizing a paragraph from Moby Dick.

Recurrent neural networks (RNN) are capable of learning to encode and exploit activation history over an arbitrary timescale. However, in practice, state of the art gradient descent based training methods are known to suffer from difficulties in learning long term dependencies. Here, we describe a novel training method that involves concurrent parallel cloned networks, each sharing the same weights, each trained at different stimulus phase and each maintaining independent activation histories. Training proceeds by recursively performing batch-updates over the parallel clones as activation history is progressively increased. This allows conflicts to propagate hierarchically from short-term contexts towards longer-term contexts until they are resolved. We illustrate the parallel clones method and hierarchical conflict propagation with a character-level deep RNN tasked with memorizing a paragraph of Moby Dick (by Herman Melville).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes