CVNov 10, 2023

2D Image head pose estimation via latent space regression under occlusion settings

arXiv:2311.06038v121 citationsh-index: 36
Originality Incremental advance
AI Analysis

This addresses a critical limitation in computer vision for applications like human-robot interaction, though it is incremental as it builds on existing methods.

The paper tackles head pose estimation under occlusion by proposing a deep learning approach using latent space regression, achieving state-of-the-art performance on occluded datasets and similar accuracy on non-occluded ones.

Head orientation is a challenging Computer Vision problem that has been extensively researched having a wide variety of applications. However, current state-of-the-art systems still underperform in the presence of occlusions and are unreliable for many task applications in such scenarios. This work proposes a novel deep learning approach for the problem of head pose estimation under occlusions. The strategy is based on latent space regression as a fundamental key to better structure the problem for occluded scenarios. Our model surpasses several state-of-the-art methodologies for occluded HPE, and achieves similar accuracy for non-occluded scenarios. We demonstrate the usefulness of the proposed approach with: (i) two synthetically occluded versions of the BIWI and AFLW2000 datasets, (ii) real-life occlusions of the Pandora dataset, and (iii) a real-life application to human-robot interaction scenarios where face occlusions often occur. Specifically, the autonomous feeding from a robotic arm.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes