ROMar 6

Sticky-Glance: Robust Intent Recognition for Human Robot Collaboration via Single-Glance

arXiv:2603.06121v1h-index: 1
AI Analysis

This addresses intent recognition for impaired people with limited motor capabilities, presenting a novel method for a known bottleneck.

The paper tackled robust gaze-based intent recognition in multi-object environments for impaired individuals, achieving a tracking rate of 0.94 for dynamic targets and selection accuracy of 0.98 for static targets, while reducing task duration by nearly 10%.

Gaze is a valuable means of communication for impaired people with extremely limited motor capabilities. However, robust gaze-based intent recognition in multi-object environments is challenging due to gaze noise, micro-saccades, viewpoint changes, and dynamic objects. To address this, we propose an object-centric gaze grounding framework that stabilizes intent through a sticky-glance algorithm, jointly modeling geometric distance and direction trends. The inferred intent remains anchored to the object even under short glances with minimal 3 gaze samples, achieving a tracking rate of 0.94 for dynamic targets and selection accuracy of 0.98 for static targets. We further introduce a continuous shared control and multi-modal interaction paradigm, enabling high-readiness control and human-in-loop feedback, thereby reducing task duration for nearly 10 \%. Experiments across dynamic tracking, multi-perspective alignment, a baseline comparison, user studies, and ablation studies demonstrate improved robustness, efficiency, and reduced workload compared to representative baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes