AIROMar 25, 2024

Speeding Up Path Planning via Reinforcement Learning in MCTS for Automated Parking

arXiv:2403.17234v23 citationsh-index: 1IROS
Originality Incremental advance
AI Analysis

This work addresses real-time path planning challenges for automated parking systems, representing an incremental improvement in method integration.

The paper tackles the computational expense of sampling-based path planning in high-dimensional spaces for automated parking by integrating reinforcement learning into Monte Carlo tree search, resulting in faster planning without human expert data while maintaining quality.

In this paper, we address a method that integrates reinforcement learning into the Monte Carlo tree search to boost online path planning under fully observable environments for automated parking tasks. Sampling-based planning methods under high-dimensional space can be computationally expensive and time-consuming. State evaluation methods are useful by leveraging the prior knowledge into the search steps, making the process faster in a real-time system. Given the fact that automated parking tasks are often executed under complex environments, a solid but lightweight heuristic guidance is challenging to compose in a traditional analytical way. To overcome this limitation, we propose a reinforcement learning pipeline with a Monte Carlo tree search under the path planning framework. By iteratively learning the value of a state and the best action among samples from its previous cycle's outcomes, we are able to model a value estimator and a policy generator for given states. By doing that, we build up a balancing mechanism between exploration and exploitation, speeding up the path planning process while maintaining its quality without using human expert driver data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes