Learning Collision-free and Torque-limited Robot Trajectories based on Alternative Safe Behaviors
This work addresses safety-critical motion planning for robots, offering a solution for real-time trajectory generation that ensures compliance with physical constraints, though it is incremental in combining existing techniques like reinforcement learning and physics simulation.
The paper tackles the problem of generating safe robot trajectories by learning to produce collision-free and torque-limited motions online, using a neural network trained with reinforcement learning. The method reliably prevents collisions and respects torque and joint limits, as demonstrated in simulations with humanoid and industrial robots and real-time experiments on a real robot.
This paper presents an approach for learning online generation of collision-free and torque-limited robot trajectories. In order to generate future motions, a neural network is periodically invoked. Based on the current kinematic state of the robot and the network output, a trajectory for the current time interval can be calculated. The main idea of our paper is to execute the computed motion only if a collision-free and torque-limited way to continue the trajectory is known. In practice, the motion computed for the current time interval is extended by a braking trajectory and simulated using a physics engine. If the simulated trajectory complies with all safety constraints, the computed motion is carried out. Otherwise, the braking trajectory calculated in the previous time interval serves as an alternative safe behavior. Given a task-specific reward function, the neural network is trained using reinforcement learning. The design of the action space used for reinforcement learning ensures that all computed trajectories comply with kinematic joint limits. For our evaluation, simulated humanoid robots and industrial robots are trained to reach as many randomly placed target points as possible. We show that our method reliably prevents collisions with static obstacles and collisions between the robot arms, while generating motions that respect both torque limits and kinematic joint limits. Experiments with a real robot demonstrate that safe trajectories can be generated in real-time.