LGDMNEMLOct 11, 2017

StackSeq2Seq: Dual Encoder Seq2Seq Recurrent Networks

arXiv:1710.04211v21 citations
Originality Incremental advance
AI Analysis

This work addresses graph routing challenges, but it appears incremental as it builds on existing Seq2Seq models with modifications.

The paper tackles the NP-hard problem of finding routes between graph nodes by proposing a dual encoder Seq2Seq recurrent network architecture, which increases accuracy in learning shortest routes and boosts performance through loss function smoothing.

A widely studied non-deterministic polynomial time (NP) hard problem lies in finding a route between the two nodes of a graph. Often meta-heuristics algorithms such as $A^{*}$ are employed on graphs with a large number of nodes. Here, we propose a deep recurrent neural network architecture based on the Sequence-2-Sequence (Seq2Seq) model, widely used, for instance in text translation. Particularly, we illustrate that utilising a context vector that has been learned from two different recurrent networks enables increased accuracies in learning the shortest route of a graph. Additionally, we show that one can boost the performance of the Seq2Seq network by smoothing the loss function using a homotopy continuation of the decoder's loss function.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes