RO AI LG SY OCNov 6, 2021

Robust Deep Reinforcement Learning for Quadcopter Control

Aditya M. Deshpande, Ali A. Minai, Manish Kumar

arXiv:2111.03915v110.423 citationsh-index: 65Has Code

Originality Incremental advance

AI Analysis

This work addresses robustness in drone control for robotics applications, but it is incremental as it combines existing robust control and RL methods.

The paper tackled the problem of poor generalization in deep reinforcement learning policies when transferred between environments by using Robust Markov Decision Processes for quadcopter control, resulting in a robust policy that outperformed standard agents in unseen environments.

Deep reinforcement learning (RL) has made it possible to solve complex robotics problems using neural networks as function approximators. However, the policies trained on stationary environments suffer in terms of generalization when transferred from one environment to another. In this work, we use Robust Markov Decision Processes (RMDP) to train the drone control policy, which combines ideas from Robust Control and RL. It opts for pessimistic optimization to handle potential gaps between policy transfer from one environment to another. The trained control policy is tested on the task of quadcopter positional control. RL agents were trained in a MuJoCo simulator. During testing, different environment parameters (unseen during the training) were used to validate the robustness of the trained policy for transfer from one environment to another. The robust policy outperformed the standard agents in these environments, suggesting that the added robustness increases generality and can adapt to non-stationary environments. Codes: https://github.com/adipandas/gym_multirotor

View on arXiv PDF Code

Similar