CVAILGROFeb 25, 2022

6D Rotation Representation For Unconstrained Head Pose Estimation

arXiv:2202.12555v2137 citationsHas Code
Originality Highly original
AI Analysis

This work addresses head pose estimation for applications like human-computer interaction, offering a significant performance improvement over existing methods.

The paper tackles ambiguous rotation labels in head pose estimation by introducing a continuous 6D rotation matrix representation and a geodesic distance-based loss, resulting in a method that outperforms state-of-the-art approaches by up to 20% on AFLW2000 and BIWI datasets.

In this paper, we present a method for unconstrained end-to-end head pose estimation. We address the problem of ambiguous rotation labels by introducing the rotation matrix formalism for our ground truth data and propose a continuous 6D rotation matrix representation for efficient and robust direct regression. This way, our method can learn the full rotation appearance which is contrary to previous approaches that restrict the pose prediction to a narrow-angle for satisfactory results. In addition, we propose a geodesic distance-based loss to penalize our network with respect to the SO(3) manifold geometry. Experiments on the public AFLW2000 and BIWI datasets demonstrate that our proposed method significantly outperforms other state-of-the-art methods by up to 20\%. We open-source our training and testing code along with our pre-trained models: https://github.com/thohemp/6DRepNet.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes