ROApr 10

Perception Is All You Need: A Neuroscience Framework for Low Cost Sensorless Gaze in HRI

arXiv:2604.098298.2h-index: 6
Predicted impact top 66% in RO · last 90 daysOriginality Incremental advance
AI Analysis

For researchers and practitioners in human-robot interaction, this work provides a low-cost, privacy-preserving alternative to expensive gaze-following systems, potentially democratizing access to gaze-based interaction in educational and therapeutic settings.

This paper proposes a sensorless, power-free gaze-following framework for child-robot interaction that exploits the human visual system's convexity assumption, enabling sub-dollar cardboard robots to produce perceived mutual gaze without any electronics. The framework is grounded in neuroscience evidence and offers a path to deliver two decades of HRI gaze findings at population scale.

Gaze-following in child-robot interaction improves attention, recall, and learning, but requires expensive platforms (\$30,000+), sensors, algorithms, and raises privacy concerns. We propose a framework that avoids sensors and computation entirely, instead relying on the human visual system's assumption of convexity to produce perceptual gaze-following between a robot and its viewer. Specifically, we motivate sub-dollar cardboard robot design that directly implements the brain's own gaze computation pipeline in reverse, making the viewer's perceptual system the robot's "actuator", with no sensors, no power, and no privacy concerns. We ground this framework in three converging lines of theoretical and empirical neuroscience evidence. Namely, the distributed face processing network that computes gaze direction via the superior temporal sulcus, the high-precision convexity prior that causes the brain to perceive concave faces as convex, and the predictive processing hierarchy in which top-down face knowledge overrides bottom-up depth signals. These mechanisms explain why a concave eye socket with a painted pupil produces the perception of mutual gaze from any viewing angle. We derive design constraints from perceptual science, present a sub-dollar open-template robot with parameterized interchangeable eye inserts, and identify boundary conditions (developmental, clinical, and geometric) that predict where the framework will succeed and where it will fail. If leveraged, two decades of HRI gaze findings become deliverable at population scale.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes