LG DM NE MLOct 11, 2017

StackSeq2Seq: Dual Encoder Seq2Seq Recurrent Networks

arXiv:1710.04211v21.41 citations

Originality Incremental advance

AI Analysis

This work addresses graph routing challenges, but it appears incremental as it builds on existing Seq2Seq models with modifications.

The paper tackles the NP-hard problem of finding routes between graph nodes by proposing a dual encoder Seq2Seq recurrent network architecture, which increases accuracy in learning shortest routes and boosts performance through loss function smoothing.

A widely studied non-deterministic polynomial time (NP) hard problem lies in finding a route between the two nodes of a graph. Often meta-heuristics algorithms such as $A^{*}$ are employed on graphs with a large number of nodes. Here, we propose a deep recurrent neural network architecture based on the Sequence-2-Sequence (Seq2Seq) model, widely used, for instance in text translation. Particularly, we illustrate that utilising a context vector that has been learned from two different recurrent networks enables increased accuracies in learning the shortest route of a graph. Additionally, we show that one can boost the performance of the Seq2Seq network by smoothing the loss function using a homotopy continuation of the decoder's loss function.

View on arXiv PDF

Similar