AIFeb 28, 2021

Where the Action is: Let's make Reinforcement Learning for Stochastic Dynamic Vehicle Routing Problems work!

Florentin D Hildebrandt, Barrett Thomas, Marlin W Ulmer

arXiv:2103.00507v16.18 citations

Originality Synthesis-oriented

AI Analysis

This addresses the problem of real-time urban logistics for service providers, but it is incremental as it focuses on synthesizing existing work rather than proposing new methods.

The paper tackles the challenge of solving stochastic dynamic vehicle routing problems (SDVRPs) by highlighting the need for collaboration between operations research and reinforcement learning communities, as current approaches are insufficiently integrated, and it surveys existing research to guide joint solutions.

There has been a paradigm-shift in urban logistic services in the last years; demand for real-time, instant mobility and delivery services grows. This poses new challenges to logistic service providers as the underlying stochastic dynamic vehicle routing problems (SDVRPs) require anticipatory real-time routing actions. Searching the combinatorial action space for efficient routing actions is by itself a complex task of mixed-integer programming (MIP) well-known by the operations research community. This complexity is now multiplied by the challenge of evaluating such actions with respect to their effectiveness given future dynamism and uncertainty, a potentially ideal case for reinforcement learning (RL) well-known by the computer science community. For solving SDVRPs, joint work of both communities is needed, but as we show, essentially non-existing. Both communities focus on their individual strengths leaving potential for improvement. Our survey paper highlights this potential in research originating from both communities. We point out current obstacles in SDVRPs and guide towards joint approaches to overcome them.

View on arXiv PDF

Similar