ROLGSYApr 1, 2025

Value Iteration for Learning Concurrently Executable Robotic Control Tasks

arXiv:2504.01174v1h-index: 22AAMAS
Originality Incremental advance
AI Analysis

This addresses the challenge of concurrent task execution for redundant robotic systems like multi-robot systems and manipulators, representing an incremental advancement in multi-objective RL.

The paper tackles the problem of training redundant robots to execute multiple tasks concurrently by proposing a reinforcement learning method that learns independent value functions and combines them in prioritized stacks, demonstrating the approach on several robotic systems.

Many modern robotic systems such as multi-robot systems and manipulators exhibit redundancy, a property owing to which they are capable of executing multiple tasks. This work proposes a novel method, based on the Reinforcement Learning (RL) paradigm, to train redundant robots to be able to execute multiple tasks concurrently. Our approach differs from typical multi-objective RL methods insofar as the learned tasks can be combined and executed in possibly time-varying prioritized stacks. We do so by first defining a notion of task independence between learned value functions. We then use our definition of task independence to propose a cost functional that encourages a policy, based on an approximated value function, to accomplish its control objective while minimally interfering with the execution of higher priority tasks. This allows us to train a set of control policies that can be executed simultaneously. We also introduce a version of fitted value iteration to learn to approximate our proposed cost functional efficiently. We demonstrate our approach on several scenarios and robotic systems.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes