ROCVHCJun 2

Face versus Body Tracking for Human-Robot Interaction: An Egocentric Dataset

DeepMind
arXiv:2606.0369461.1h-index: 33
Predicted impact top 34% in RO · last 90 daysOriginality Incremental advance
AI Analysis

For social robot developers, this work identifies critical gaps in standard tracking benchmarks and provides an optimized pipeline to reduce interaction breakdowns in human-robot interaction.

The authors created an egocentric dataset from a social robot to evaluate face vs. body tracking, showing that integrating re-identification reduces identity switches by 49% but causes facial tracking to spike due to profile sensitivity.

To enable meaningful human-robot interaction (HRI), a robot must continuously assess engagement by consistently tracking users over time. State-of-the-art computer vision models, however, are heavily optimized for surveillance or autonomous driving. A social robot faces distinct egocentric challenges, such as humans bouncing, obstructing each other, or leaving the frame. Frequent identity switches (IDSW) cause the robot to lose its footing mid-conversation. To address this, we introduce a novel, custom-annotated egocentric dataset collected via the Furhat robot to capture complex social dynamics. We present a systematic evaluation isolating detection errors from tracking logic, comparing face versus body tracking, and assessing the impact of extended spatial memory and appearance re-identification (ReID). Results indicate that increasing spatial memory mitigates prolonged occlusions but fails on complex dynamic events. Integrating ReID resolves complex switches but exhibits opposing effects: it substantially improves body tracking stability, yet causes facial IDSW to spike due to profile angle sensitivity. Ultimately, our optimized pipeline reduces IDSW by 49\%, mitigating interaction breakdowns. Because standard benchmarks lack dense, close-quarter occlusions, this work highlights the critical need for natively captured social dynamics to truly validate HRI perception models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes