RO AI CV HC LGMar 21, 2025

HAPI: A Model for Learning Robot Facial Expressions from Human Preferences

Dongsheng Yang, Qianying Liu, Wataru Sato, Takashi Minato, Chaoran Liu, Shin'ya Nishida

arXiv:2503.17046v23.22 citationsh-index: 4Has CodeIROS

Originality Incremental advance

AI Analysis

This work addresses the challenge of creating natural human-robot interactions through improved facial expressions, representing an incremental advance in automated robotic expression generation.

The paper tackled the problem of generating realistic robotic facial expressions by proposing a learning-to-rank framework that uses human feedback to bridge the gap between human preferences and model predictions, resulting in significantly more realistic and socially resonant expressions for Anger, Happiness, and Surprise on a 35-DOF android platform compared to baselines.

Automatic robotic facial expression generation is crucial for human-robot interaction, as handcrafted methods based on fixed joint configurations often yield rigid and unnatural behaviors. Although recent automated techniques reduce the need for manual tuning, they tend to fall short by not adequately bridging the gap between human preferences and model predictions-resulting in a deficiency of nuanced and realistic expressions due to limited degrees of freedom and insufficient perceptual integration. In this work, we propose a novel learning-to-rank framework that leverages human feedback to address this discrepancy and enhanced the expressiveness of robotic faces. Specifically, we conduct pairwise comparison annotations to collect human preference data and develop the Human Affective Pairwise Impressions (HAPI) model, a Siamese RankNet-based approach that refines expression evaluation. Results obtained via Bayesian Optimization and online expression survey on a 35-DOF android platform demonstrate that our approach produces significantly more realistic and socially resonant expressions of Anger, Happiness, and Surprise than those generated by baseline and expert-designed methods. This confirms that our framework effectively bridges the gap between human preferences and model predictions while robustly aligning robotic expression generation with human affective responses.

View on arXiv PDF Code

Similar