AILGMay 20, 2022

A Fully Controllable Agent in the Path Planning using Goal-Conditioned Reinforcement Learning

arXiv:2205.09967v1h-index: 6
Originality Incremental advance
AI Analysis

This work addresses the limitation of single-goal path planning in robotics or AI systems, offering a more flexible approach, though it is incremental as it builds on existing goal-conditioned RL methods.

The authors tackled the problem of path planning for agents to reach multiple, previously unseen goals by proposing a goal-conditioned reinforcement learning framework with bi-directional memory editing and reward shaping, resulting in the agent successfully performing difficult missions like round trips and using shorter routes.

The aim of path planning is to reach the goal from starting point by searching for the route of an agent. In the path planning, the routes may vary depending on the number of variables such that it is important for the agent to reach various goals. Numerous studies, however, have dealt with a single goal that is predefined by the user. In the present study, I propose a novel reinforcement learning framework for a fully controllable agent in the path planning. To do this, I propose a bi-directional memory editing to obtain various bi-directional trajectories of the agent, in which the behavior of the agent and sub-goals are trained on the goal-conditioned RL. As for moving the agent in various directions, I utilize the sub-goals dedicated network, separated from a policy network. Lastly, I present the reward shaping to shorten the number of steps for the agent to reach the goal. In the experimental result, the agent was able to reach the various goals that have never been visited by the agent in the training. We confirmed that the agent could perform difficult missions such as a round trip and the agent used the shorter route with the reward shaping.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes