ROSYJul 8, 2021

Reinforcement Learning based Negotiation-aware Motion Planning of Autonomous Vehicles

arXiv:2107.03600v111 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of safe and predictable motion planning for autonomous vehicles in mixed human-robot traffic, though it is incremental as it builds on existing RL and motion planning methods.

The paper tackles the problem of autonomous vehicles navigating with human traffic participants by proposing a reinforcement learning framework that adjusts driving style in real-time based on environmental changes, demonstrating improved performance in narrow lane navigation through simulation and real-world tests.

For autonomous vehicles integrating onto roadways with human traffic participants, it requires understanding and adapting to the participants' intention and driving styles by responding in predictable ways without explicit communication. This paper proposes a reinforcement learning based negotiation-aware motion planning framework, which adopts RL to adjust the driving style of the planner by dynamically modifying the prediction horizon length of the motion planner in real time adaptively w.r.t the event of a change in environment, typically triggered by traffic participants' switch of intents with different driving styles. The framework models the interaction between the autonomous vehicle and other traffic participants as a Markov Decision Process. A temporal sequence of occupancy grid maps are taken as inputs for RL module to embed an implicit intention reasoning. Curriculum learning is employed to enhance the training efficiency and the robustness of the algorithm. We applied our method to narrow lane navigation in both simulation and real world to demonstrate that the proposed method outperforms the common alternative due to its advantage in alleviating the social dilemma problem with proper negotiation skills.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes