ROAIFeb 5, 2021

An advantage actor-critic algorithm for robotic motion planning in dense and dynamic scenarios

arXiv:2102.03138v13 citations
Originality Incremental advance
AI Analysis

This work provides an incremental improvement for robotic motion planning in complex, dynamic environments, benefiting industries and services that deploy intelligent robots.

This paper addresses the challenge of robotic motion planning in dense and dynamic environments, where traditional algorithms struggle with computational cost and optimal-value reinforcement learning (RL) suffers from slow speed and instability. The authors modified an advantage actor-critic algorithm to generate optimal robot speeds and directions, achieving faster and more stable convergence than optimal-value RL, along with a higher success rate and reduced processing time.

Intelligent robots provide a new insight into efficiency improvement in industrial and service scenarios to replace human labor. However, these scenarios include dense and dynamic obstacles that make motion planning of robots challenging. Traditional algorithms like A* can plan collision-free trajectories in static environment, but their performance degrades and computational cost increases steeply in dense and dynamic scenarios. Optimal-value reinforcement learning algorithms (RL) can address these problems but suffer slow speed and instability in network convergence. Network of policy gradient RL converge fast in Atari games where action is discrete and finite, but few works have been done to address problems where continuous actions and large action space are required. In this paper, we modify existing advantage actor-critic algorithm and suit it to complex motion planning, therefore optimal speeds and directions of robot are generated. Experimental results demonstrate that our algorithm converges faster and stable than optimal-value RL. It achieves higher success rate in motion planning with lesser processing time for robot to reach its goal.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes