ROAILGJan 22, 2024

Efficient and Generalized end-to-end Autonomous Driving System with Latent Deep Reinforcement Learning and Demonstrations

arXiv:2401.11792v810 citationsh-index: 7ECML/PKDD
Originality Incremental advance
AI Analysis

This work addresses the problem of developing efficient and safe autonomous driving systems for complex urban environments, representing an incremental improvement over existing methods.

The paper tackles the challenges of high sample complexity, poor generalization, and low safety in autonomous driving systems by introducing EGADS, which combines latent deep reinforcement learning with expert demonstrations. Experimental results show that EGADS significantly reduces sample complexity, greatly improves safety performance, and exhibits strong generalization in complex urban scenarios.

An intelligent driving system should dynamically formulate appropriate driving strategies based on the current environment and vehicle status while ensuring system security and reliability. However, methods based on reinforcement learning and imitation learning often suffer from high sample complexity, poor generalization, and low safety. To address these challenges, this paper introduces an efficient and generalized end-to-end autonomous driving system (EGADS) for complex and varied scenarios. The RL agent in our EGADS combines variational inference with normalizing flows, which are independent of distribution assumptions. This combination allows the agent to capture historical information relevant to driving in latent space effectively, thereby significantly reducing sample complexity. Additionally, we enhance safety by formulating robust safety constraints and improve generalization and performance by integrating RL with expert demonstrations. Experimental results demonstrate that, compared to existing methods, EGADS significantly reduces sample complexity, greatly improves safety performance, and exhibits strong generalization capabilities in complex urban scenarios. Particularly, we contributed an expert dataset collected through human expert steering wheel control, specifically using the G29 steering wheel.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes