LG AIJul 17, 2021

High-Accuracy Model-Based Reinforcement Learning, a Survey

arXiv:2107.08241v113.154 citations

Originality Synthesis-oriented

AI Analysis

It addresses the high sample complexity problem in reinforcement learning for researchers and practitioners, but is incremental as it reviews existing methods.

The paper surveys model-based reinforcement learning methods that aim to reduce sample complexity by improving model accuracy, highlighting successes in robotics and games contexts.

Deep reinforcement learning has shown remarkable success in the past few years. Highly complex sequential decision making problems from game playing and robotics have been solved with deep model-free methods. Unfortunately, the sample complexity of model-free methods is often high. To reduce the number of environment samples, model-based reinforcement learning creates an explicit model of the environment dynamics. Achieving high model accuracy is a challenge in high-dimensional problems. In recent years, a diverse landscape of model-based methods has been introduced to improve model accuracy, using methods such as uncertainty modeling, model-predictive control, latent models, and end-to-end learning and planning. Some of these methods succeed in achieving high accuracy at low sample complexity, most do so either in a robotics or in a games context. In this paper, we survey these methods; we explain in detail how they work and what their strengths and weaknesses are. We conclude with a research agenda for future work to make the methods more robust and more widely applicable to other applications.

View on arXiv PDF

Similar