ROAug 3, 2020

Learning to Drive (L2D) as a Low-Cost Benchmark for Real-World Reinforcement Learning

arXiv:2008.00715v28 citationsHas Code
AI Analysis

This provides a reproducible and accessible benchmark for researchers in reinforcement learning, though it is incremental as it builds on existing RL methods applied to a new setup.

The authors tackled the problem of creating a low-cost benchmark for real-world reinforcement learning by introducing Learning to Drive (L2D), where RL agents learn to drive a Donkey car from disengagements, and they showed that existing algorithms can learn to drive from scratch in less than five minutes and outperform imitation learning and human operators.

We present Learning to Drive (L2D), a low-cost benchmark for real-world reinforcement learning (RL). L2D involves a simple and reproducible experimental setup where an RL agent has to learn to drive a Donkey car around three miniature tracks, given only monocular image observations and speed of the car. The agent has to learn to drive from disengagements, which occurs when it drives off the track. We present and open-source our training pipeline, which makes it straightforward to apply any existing RL algorithm to the task of autonomous driving with a Donkey car. We test imitation learning, state-of-the-art model-free, and model-based algorithms on the proposed L2D benchmark. Our results show that existing RL algorithms can learn to drive the car from scratch in less than five minutes of interaction. We demonstrate that RL algorithms can learn from sparse and noisy disengagement to drive even faster than imitation learning and a human operator.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes