LGAINEMay 14, 2019

TauRieL: Targeting Traveling Salesman Problem with a deep reinforcement learning inspired architecture

arXiv:1905.05567v14 citations
Originality Incremental advance
AI Analysis

This addresses the TSP, a widely applicable combinatorial optimization problem, by offering a faster solver with competitive accuracy, though it is incremental in improving existing methods.

The paper tackles the Traveling Salesman Problem (TSP) by proposing TauRieL, a deep reinforcement learning inspired architecture that generates near-optimal solutions two orders of magnitude faster than state-of-the-art offline techniques, with a worst-case performance impact of 6.1%.

In this paper, we propose TauRieL and target Traveling Salesman Problem (TSP) since it has broad applicability in theoretical and applied sciences. TauRieL utilizes an actor-critic inspired architecture that adopts ordinary feedforward nets to obtain a policy update vector $v$. Then, we use $v$ to improve the state transition matrix from which we generate the policy. Also, the state transition matrix allows the solver to initialize from precomputed solutions such as nearest neighbors. In an online learning setting, TauRieL unifies the training and the search where it can generate near-optimal results in seconds. The input to the neural nets in the actor-critic architecture are raw 2-D inputs, and the design idea behind this decision is to keep neural nets relatively smaller than the architectures with wide embeddings with the tradeoff of omitting any distributed representations of the embeddings. Consequently, TauRieL generates TSP solutions two orders of magnitude faster per TSP instance as compared to state-of-the-art offline techniques with a performance impact of 6.1\% in the worst case.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes