CVAIHCNov 13, 2021

UET-Headpose: A sensor-based top-view head pose dataset

arXiv:2111.07039v13 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of expensive and complex head pose data collection for applications like human-robot interaction, though it is incremental as it builds on existing datasets and methods.

The authors tackled the challenge of head pose estimation by introducing the UET-Headpose dataset, which uses an absolute orientation sensor for low-cost, easy setup top-view data collection, and they developed the FSANet-Wide model that significantly outperforms on top-view images while being lightweight.

Head pose estimation is a challenging task that aims to solve problems related to predicting three dimensions vector, that serves for many applications in human-robot interaction or customer behavior. Previous researches have proposed some precise methods for collecting head pose data. But those methods require either expensive devices like depth cameras or complex laboratory environment setup. In this research, we introduce a new approach with efficient cost and easy setup to collecting head pose images, namely UET-Headpose dataset, with top-view head pose data. This method uses an absolute orientation sensor instead of Depth cameras to be set up quickly and small cost but still ensure good results. Through experiments, our dataset has been shown the difference between its distribution and available dataset like CMU Panoptic Dataset \cite{CMU}. Besides using the UET-Headpose dataset and other head pose datasets, we also introduce the full-range model called FSANet-Wide, which significantly outperforms head pose estimation results by the UET-Headpose dataset, especially on top-view images. Also, this model is very lightweight and takes small size images.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes