CVJul 26, 2018

Deep Pictorial Gaze Estimation

arXiv:1807.10002v1203 citations
Originality Incremental advance
AI Analysis

This addresses gaze estimation for applications like human-computer interaction, but it is incremental as it builds on existing deep learning approaches with a novel architectural tweak.

The paper tackles the ill-posed problem of estimating human gaze from single eye images by introducing a deep neural network that regresses to an intermediate pictorial representation, achieving higher accuracies than state-of-the-art methods and robustness to variations in gaze, head pose, and image quality.

Estimating human gaze from natural eye images only is a challenging task. Gaze direction can be defined by the pupil- and the eyeball center where the latter is unobservable in 2D images. Hence, achieving highly accurate gaze estimates is an ill-posed problem. In this paper, we introduce a novel deep neural network architecture specifically designed for the task of gaze estimation from single eye input. Instead of directly regressing two angles for the pitch and yaw of the eyeball, we regress to an intermediate pictorial representation which in turn simplifies the task of 3D gaze direction estimation. Our quantitative and qualitative results show that our approach achieves higher accuracies than the state-of-the-art and is robust to variation in gaze, head pose and image quality.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes