RO LG SYApr 1, 2025

Value Iteration for Learning Concurrently Executable Robotic Control Tasks

arXiv:2504.01174v1h-index: 22AAMAS

Originality Incremental advance

AI Analysis

This addresses the challenge of concurrent task execution for redundant robotic systems like multi-robot systems and manipulators, representing an incremental advancement in multi-objective RL.

The paper tackles the problem of training redundant robots to execute multiple tasks concurrently by proposing a reinforcement learning method that learns independent value functions and combines them in prioritized stacks, demonstrating the approach on several robotic systems.

Many modern robotic systems such as multi-robot systems and manipulators exhibit redundancy, a property owing to which they are capable of executing multiple tasks. This work proposes a novel method, based on the Reinforcement Learning (RL) paradigm, to train redundant robots to be able to execute multiple tasks concurrently. Our approach differs from typical multi-objective RL methods insofar as the learned tasks can be combined and executed in possibly time-varying prioritized stacks. We do so by first defining a notion of task independence between learned value functions. We then use our definition of task independence to propose a cost functional that encourages a policy, based on an approximated value function, to accomplish its control objective while minimally interfering with the execution of higher priority tasks. This allows us to train a set of control policies that can be executed simultaneously. We also introduce a version of fitted value iteration to learn to approximate our proposed cost functional efficiently. We demonstrate our approach on several scenarios and robotic systems.

View on arXiv PDF

Similar