LGAIRODec 16, 2021

Benchmarking Safe Deep Reinforcement Learning in Aquatic Navigation

arXiv:2112.10593v121 citations
Originality Incremental advance
AI Analysis

This provides a benchmark for safe reinforcement learning in aquatic navigation, though it appears incremental in combining existing techniques.

The authors tackled the problem of safe deep reinforcement learning for aquatic navigation by proposing a crossover-based training strategy and verification method, achieving improved sample efficiency over prior approaches and quantifying safety violations through interval analysis.

We propose a novel benchmark environment for Safe Reinforcement Learning focusing on aquatic navigation. Aquatic navigation is an extremely challenging task due to the non-stationary environment and the uncertainties of the robotic platform, hence it is crucial to consider the safety aspect of the problem, by analyzing the behavior of the trained network to avoid dangerous situations (e.g., collisions). To this end, we consider a value-based and policy-gradient Deep Reinforcement Learning (DRL) and we propose a crossover-based strategy that combines gradient-based and gradient-free DRL to improve sample-efficiency. Moreover, we propose a verification strategy based on interval analysis that checks the behavior of the trained models over a set of desired properties. Our results show that the crossover-based training outperforms prior DRL approaches, while our verification allows us to quantify the number of configurations that violate the behaviors that are described by the properties. Crucially, this will serve as a benchmark for future research in this domain of applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes