CVJan 21, 2019

Hybrid coarse-fine classification for head pose estimation

arXiv:1901.06778v220 citations
Originality Incremental advance
AI Analysis

This work addresses a key limitation in head pose estimation for applications like gaze estimation and face alignment, offering an incremental improvement over traditional methods.

The paper tackles head pose estimation without relying on facial landmarks by combining coarse and fine regression outputs in a deep network, achieving state-of-the-art results on AFLW2000 and BIWI benchmarks.

Head pose estimation, which computes the intrinsic Euler angles (yaw, pitch, roll) from the human, is crucial for gaze estimation, face alignment, and 3D reconstruction. Traditional approaches heavily relies on the accuracy of facial landmarks. It limits their performances, especially when the visibility of the face is not in good condition. In this paper, to do the estimation without facial landmarks, we combine the coarse and fine regression output together for a deep network. Utilizing more quantization units for the angles, a fine classifier is trained with the help of other auxiliary coarse units. Integrating regression is adopted to get the final prediction. The proposed approach is evaluated on three challenging benchmarks. It achieves the state-of-the-art on AFLW2000, BIWI and performs favorably on AFLW. The code has been released on Github.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes