CVHCMay 18

Low Latency Gaze Tracking via Latent Optical Sensing

arXiv:2605.1799010.5
Predicted impact top 73% in CV · last 90 daysOriginality Highly original
AI Analysis

This work addresses the need for ultra-low-latency gaze tracking in human-computer interaction by eliminating image readout and computation bottlenecks.

The authors developed a gaze tracking system using a passive optical encoder that directly acquires latent features, achieving 3.4 ms end-to-end latency while maintaining competitive accuracy, outperforming conventional camera-based methods.

We present a real-time gaze tracking system that directly acquires task-relevant latent features using a fully passive optical encoder. Instead of forming and processing full-resolution images, our approach leverages a microlens array with a co-designed binary chromium mask to perform spatially multiplexed optical encoding, producing a compact set of measurements sufficient for gaze estimation. By integrating sensing and feature extraction in the optical domain, the proposed system eliminates the need for high-bandwidth image readout and substantially reduces computational overhead. The encoded measurements are captured by a 4 x 4 phototransistor array and mapped to gaze direction using a lightweight neural network. Our proof-of-concept prototype enables an end-to-end sensing-to-inference latency of 3.4 ms, outperforming published research systems. We demonstrate the effectiveness of our approach on both simulated and real-world data, achieving competitive gaze estimation accuracy while significantly improving latency and energy efficiency compared to conventional camera-based pipelines. This work highlights the potential of task-driven optical sensing for ultra-low-latency, computationally efficient human-computer interaction systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes