Zero-Shot Reinforcement Learning on Graphs for Autonomous Exploration Under Uncertainty
This addresses the problem of inefficient training in transfer learning for robotics by enabling zero-shot transfer, though it is incremental as it builds on existing graph neural network and reinforcement learning methods.
The paper tackles autonomous exploration under localization uncertainty for mobile robots by proposing a framework that learns an exploration policy in a single simulation environment and transfers it zero-shot to other environments, achieving real-time and scalable decision-making without additional training.
This paper studies the problem of autonomous exploration under localization uncertainty for a mobile robot with 3D range sensing. We present a framework for self-learning a high-performance exploration policy in a single simulation environment, and transferring it to other environments, which may be physical or virtual. Recent work in transfer learning achieves encouraging performance by domain adaptation and domain randomization to expose an agent to scenarios that fill the inherent gaps in sim2sim and sim2real approaches. However, it is inefficient to train an agent in environments with randomized conditions to learn the important features of its current state. An agent can use domain knowledge provided by human experts to learn efficiently. We propose a novel approach that uses graph neural networks in conjunction with deep reinforcement learning, enabling decision-making over graphs containing relevant exploration information provided by human experts to predict a robot's optimal sensing action in belief space. The policy, which is trained only in a single simulation environment, offers a real-time, scalable, and transferable decision-making strategy, resulting in zero-shot transfer to other simulation environments and even real-world environments.