CVApr 20, 2020

IPN Hand: A Video Dataset and Benchmark for Real-Time Continuous Hand Gesture Recognition

arXiv:2005.02134v2103 citationsHas Code
AI Analysis

This work addresses the need for a realistic benchmark dataset for continuous hand gesture recognition, which is incremental as it builds on existing datasets like nvGesture.

The authors tackled the problem of real-time continuous hand gesture recognition by introducing the IPN Hand dataset, which contains over 4,000 gesture samples and 800,000 RGB frames from 50 subjects, and they found that the state-of-the-art ResNext-101 model decreased about 30% accuracy when using this dataset, demonstrating its utility as a benchmark.

In this paper, we introduce a new benchmark dataset named IPN Hand with sufficient size, variety, and real-world elements able to train and evaluate deep neural networks. This dataset contains more than 4,000 gesture samples and 800,000 RGB frames from 50 distinct subjects. We design 13 different static and dynamic gestures focused on interaction with touchless screens. We especially consider the scenario when continuous gestures are performed without transition states, and when subjects perform natural movements with their hands as non-gesture actions. Gestures were collected from about 30 diverse scenes, with real-world variation in background and illumination. With our dataset, the performance of three 3D-CNN models is evaluated on the tasks of isolated and continuous real-time HGR. Furthermore, we analyze the possibility of increasing the recognition accuracy by adding multiple modalities derived from RGB frames, i.e., optical flow and semantic segmentation, while keeping the real-time performance of the 3D-CNN model. Our empirical study also provides a comparison with the publicly available nvGesture (NVIDIA) dataset. The experimental results show that the state-of-the-art ResNext-101 model decreases about 30% accuracy when using our real-world dataset, demonstrating that the IPN Hand dataset can be used as a benchmark, and may help the community to step forward in the continuous HGR. Our dataset and pre-trained models used in the evaluation are publicly available at https://github.com/GibranBenitez/IPN-hand.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes