ROHCMay 12, 2020

Towards Transparency of TD-RL Robotic Systems with a Human Teacher

arXiv:2005.05926v13 citations
AI Analysis

This addresses transparency issues in human-robot collaboration for users interacting with learning robots, but it is incremental as it builds on existing RL and emotional modeling approaches.

The authors tackled the problem of non-transparent robot behavior during reinforcement learning in human-robot interaction by proposing an emotional model based on Temporal Difference error to improve transparency. The results showed that displaying internal states through emotional responses made the robot transparent to human teachers, with users preferring responsive robots due to familiarity with emotional cues.

The high request for autonomous and flexible HRI implies the necessity of deploying Machine Learning (ML) mechanisms in the robot control. Indeed, the use of ML techniques, such as Reinforcement Learning (RL), makes the robot behaviour, during the learning process, not transparent to the observing user. In this work, we proposed an emotional model to improve the transparency in RL tasks for human-robot collaborative scenarios. The architecture we propose supports the RL algorithm with an emotional model able to both receive human feedback and exhibit emotional responses based on the learning process. The model is entirely based on the Temporal Difference (TD) error. The architecture was tested in an isolated laboratory with a simple setup. The results highlight that showing its internal state through an emotional response is enough to make a robot transparent to its human teacher. People also prefer to interact with a responsive robot because they are used to understand their intentions via emotions and social signals.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes