Autonomous quadrotor obstacle avoidance based on dueling double deep recurrent Q-learning with monocular vision
This addresses the problem of enabling small UAVs with limited sensors to navigate crowded environments autonomously, representing an incremental improvement in reinforcement learning applications for robotics.
The paper tackles autonomous obstacle avoidance for quadrotors using monocular vision, proposing a two-stage framework with unsupervised deep learning for sensing and dueling double deep recurrent Q-learning for decision-making, achieving a high success rate in simulations without prior environment information or labeled datasets.
The rapid development of unmanned aerial vehicles (UAV) puts forward a higher requirement for autonomous obstacle avoidance. Due to the limited payload and power supply, small UAVs such as quadrotors usually carry simple sensors and computation units, which makes traditional methods more challenging to implement. In this paper, a novel framework is demonstrated to control a quadrotor flying through crowded environments autonomously with monocular vision. The framework adopts a two-stage architecture, consisting of a sensing module and a decision module. The sensing module is based on an unsupervised deep learning method. And the decision module uses dueling double deep recurrent Q-learning to eliminate the adverse effects of limited observation capacity of an on-board monocular camera. The framework enables the quadrotor to realize autonomous obstacle avoidance without any prior environment information or labeled datasets for training. The trained model shows a high success rate in the simulation and a good generalization ability for transformed scenarios.