RONov 30, 2018

PEARL: PrEference Appraisal Reinforcement Learning for Motion Planning

arXiv:1811.12651v13 citations
Originality Incremental advance
AI Analysis

This addresses motion planning for robots needing to handle multiple preferences and disturbances, but it appears incremental as it builds on existing RL and feature engineering approaches.

The paper tackles robot motion planning for balancing user preferences like goal arrival and obstacle avoidance by proposing PEARL, a reinforcement learning method that trains in restricted domains and generalizes to complex problems, achieving successful performance on tasks including navigation through 900 moving obstacles and physical quadrotor experiments.

Robot motion planning often requires finding trajectories that balance different user intents, or preferences. One of these preferences is usually arrival at the goal, while another might be obstacle avoidance. Here, we formalize these, and similar, tasks as preference balancing tasks (PBTs) on acceleration controlled robots, and propose a motion planning solution, PrEference Appraisal Reinforcement Learning (PEARL). PEARL uses reinforcement learning on a restricted training domain, combined with features engineered from user-given intents. PEARL's planner then generates trajectories in expanded domains for more complex problems. We present an adaptation for rejection of stochastic disturbances and offer in-depth analysis, including task completion conditions and behavior analysis when the conditions do not hold. PEARL is evaluated on five problems, two multi-agent obstacle avoidance tasks and three that stochastically disturb the system at run-time: 1) a multi-agent pursuit problem with 1000 pursuers, 2) robot navigation through 900 moving obstacles, which is is trained with in an environment with only 4 static obstacles, 3) aerial cargo delivery, 4) two robot rendezvous, and 5) flying inverted pendulum. Lastly, we evaluate the method on a physical quadrotor UAV robot with a suspended load influenced by a stochastic disturbance. The video, https://youtu.be/ZkFt1uY6vlw contains the experiments and visualization of the simulations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes