Control-Tutored Reinforcement Learning: an application to the Herding Problem
This work addresses the problem of safe and efficient reinforcement learning in continuous environments for robotics or control systems, but it appears incremental as it builds on existing Q-learning and control methods.
The authors tackled the challenge of developing model-based and safe reinforcement learning for continuous state spaces by introducing a control-tutored Q-learning approach, which they validated on a multi-agent herding control problem.
In this extended abstract we introduce a novel control-tutored Q-learning approach (CTQL) as part of the ongoing effort in developing model-based and safe RL for continuous state spaces. We validate our approach by applying it to a challenging multi-agent herding control problem.