RO LGOct 12, 2022

Smooth Trajectory Collision Avoidance through Deep Reinforcement Learning

Sirui Song, Kirk Saunders, Ye Yue, Jundong Liu

arXiv:2210.06377v14.05 citationsh-index: 10

Originality Incremental advance

AI Analysis

This work addresses collision avoidance for vision-guided autonomous UAV navigation, presenting an incremental improvement by enhancing smoothness and generalization in existing DRL methods.

The paper tackled the problem of smooth trajectory collision avoidance in UAV navigation by proposing novel agent state and reward function designs within a deep reinforcement learning framework, resulting in greatly reduced collision chances and stable flight speed through minimized trajectory derivatives.

Collision avoidance is a crucial task in vision-guided autonomous navigation. Solutions based on deep reinforcement learning (DRL) has become increasingly popular. In this work, we proposed several novel agent state and reward function designs to tackle two critical issues in DRL-based navigation solutions: 1) smoothness of the trained flight trajectories; and 2) model generalization to handle unseen environments. Formulated under a DRL framework, our model relies on margin reward and smoothness constraints to ensure UAVs fly smoothly while greatly reducing the chance of collision. The proposed smoothness reward minimizes a combination of first-order and second-order derivatives of flight trajectories, which can also drive the points to be evenly distributed, leading to stable flight speed. To enhance the agent's capability of handling new unseen environments, two practical setups are proposed to improve the invariance of both the state and reward function when deploying in different scenes. Experiments demonstrate the effectiveness of our overall design and individual components.

View on arXiv PDF

Similar