Hybrid Pointer Networks for Traveling Salesman Problems Optimization
This work addresses combinatorial optimization for routing and logistics, but it is incremental as it builds directly on existing graph pointer networks.
The authors tackled the Traveling Salesman Problem by proposing a hybrid pointer network (HPN) that improves upon graph pointer networks, achieving a performance increase from 5.959 to 5.706 on TSP50 without using 2opt or other specialized algorithms.
In this work, a novel idea is presented for combinatorial optimization problems, a hybrid network, which results in a superior outcome. We applied this method to graph pointer networks [1], expanding its capabilities to a higher level. We proposed a hybrid pointer network (HPN) to solve the travelling salesman problem trained by reinforcement learning. Furthermore, HPN builds upon graph pointer networks which is an extension of pointer networks with an additional graph embedding layer. HPN outperforms the graph pointer network in solution quality due to the hybrid encoder, which provides our model with a verity encoding type, allowing our model to converge to a better policy. Our network significantly outperforms the original graph pointer network for small and large-scale problems increasing its performance for TSP50 from 5.959 to 5.706 without utilizing 2opt, Pointer networks, Attention model, and a wide range of models, producing results comparable to highly tuned and specialized algorithms. We make our data, models, and code publicly available [2].