CVHCIVMar 5, 2024

Gaze-Vector Estimation in the Dark with Temporally Encoded Event-driven Neural Networks

arXiv:2403.02909v11 citationsh-index: 16
Originality Incremental advance
AI Analysis

This addresses gaze estimation for applications like human-computer interaction and driver monitoring in low-light, but it appears incremental as it builds on existing event-driven methods with a new encoding approach.

The paper tackled gaze vector prediction in extremely low-light conditions by using a temporal event encoding scheme with DVS events and grayscale frames, achieving 100% 100-pixel accuracy in predictions.

In this paper, we address the intricate challenge of gaze vector prediction, a pivotal task with applications ranging from human-computer interaction to driver monitoring systems. Our innovative approach is designed for the demanding setting of extremely low-light conditions, leveraging a novel temporal event encoding scheme, and a dedicated neural network architecture. The temporal encoding method seamlessly integrates Dynamic Vision Sensor (DVS) events with grayscale guide frames, generating consecutively encoded images for input into our neural network. This unique solution not only captures diverse gaze responses from participants within the active age group but also introduces a curated dataset tailored for low-light conditions. The encoded temporal frames paired with our network showcase impressive spatial localization and reliable gaze direction in their predictions. Achieving a remarkable 100-pixel accuracy of 100%, our research underscores the potency of our neural network to work with temporally consecutive encoded images for precise gaze vector predictions in challenging low-light videos, contributing to the advancement of gaze prediction technologies.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes