Evaluating Robustness of Reinforcement Learning Algorithms for Autonomous Shipping
This addresses the need for reliable autonomous shipping solutions in crowded and variable maritime environments, though it is incremental as it compares existing algorithms in a new domain.
The paper tackled the problem of evaluating the robustness of reinforcement learning algorithms for autonomous shipping in inland waterway transport, finding that Soft-Actor Critic (SAC) is more robust to environmental disturbances than MuZero, successfully navigating port environments not seen during training.
Recently, there has been growing interest in autonomous shipping due to its potential to improve maritime efficiency and safety. The use of advanced technologies, such as artificial intelligence, can address the current navigational and operational challenges in autonomous shipping. In particular, inland waterway transport (IWT) presents a unique set of challenges, such as crowded waterways and variable environmental conditions. In such dynamic settings, the reliability and robustness of autonomous shipping solutions are critical factors for ensuring safe operations. This paper examines the robustness of benchmark deep reinforcement learning (RL) algorithms, implemented for IWT within an autonomous shipping simulator, and their ability to generate effective motion planning policies. We demonstrate that a model-free approach can achieve an adequate policy in the simulator, successfully navigating port environments never encountered during training. We focus particularly on Soft-Actor Critic (SAC), which we show to be inherently more robust to environmental disturbances compared to MuZero, a state-of-the-art model-based RL algorithm. In this paper, we take a significant step towards developing robust, applied RL frameworks that can be generalized to various vessel types and navigate complex port- and inland environments and scenarios.