CVLGJun 21, 2021

SHREC 2021: Track on Skeleton-based Hand Gesture Recognition in the Wild

arXiv:2106.10980v1
Originality Synthesis-oriented
AI Analysis

This work addresses the need for robust gesture recognition in applications like Mixed Reality and touchless interfaces, but it is incremental as it builds on existing benchmarks.

The paper introduced a new dataset and competition for online skeleton-based hand gesture recognition in real-world scenarios, showing that the best method achieved 0.92 F1-score, outperforming a baseline of 0.78.

Gesture recognition is a fundamental tool to enable novel interaction paradigms in a variety of application scenarios like Mixed Reality environments, touchless public kiosks, entertainment systems, and more. Recognition of hand gestures can be nowadays performed directly from the stream of hand skeletons estimated by software provided by low-cost trackers (Ultraleap) and MR headsets (Hololens, Oculus Quest) or by video processing software modules (e.g. Google Mediapipe). Despite the recent advancements in gesture and action recognition from skeletons, it is unclear how well the current state-of-the-art techniques can perform in a real-world scenario for the recognition of a wide set of heterogeneous gestures, as many benchmarks do not test online recognition and use limited dictionaries. This motivated the proposal of the SHREC 2021: Track on Skeleton-based Hand Gesture Recognition in the Wild. For this contest, we created a novel dataset with heterogeneous gestures featuring different types and duration. These gestures have to be found inside sequences in an online recognition scenario. This paper presents the result of the contest, showing the performances of the techniques proposed by four research groups on the challenging task compared with a simple baseline method.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes