ROAILGMLMar 19, 2020

Learning to Fly via Deep Model-Based Reinforcement Learning

arXiv:2003.08876v340 citations
AI Analysis

This addresses the challenge of applying reinforcement learning to real-time robot control with minimal real-world interaction, enabling autonomous drone operation without prior knowledge.

The authors tackled the problem of learning to control a quadrotor drone without engineered models by using a learnt probabilistic model of dynamics and model-based reinforcement learning, achieving flight with less than 30 minutes of experience using only onboard resources.

Learning to control robots without requiring engineered models has been a long-term goal, promising diverse and novel applications. Yet, reinforcement learning has only achieved limited impact on real-time robot control due to its high demand of real-world interactions. In this work, by leveraging a learnt probabilistic model of drone dynamics, we learn a thrust-attitude controller for a quadrotor through model-based reinforcement learning. No prior knowledge of the flight dynamics is assumed; instead, a sequential latent variable model, used generatively and as an online filter, is learnt from raw sensory input. The controller and value function are optimised entirely by propagating stochastic analytic gradients through generated latent trajectories. We show that "learning to fly" can be achieved with less than 30 minutes of experience with a single drone, and can be deployed solely using onboard computational resources and sensors, on a self-built drone.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes