CVAIDec 16, 2022

Fast Learning of Dynamic Hand Gesture Recognition with Few-Shot Learning Models

arXiv:2212.08363v13 citationsh-index: 3
Originality Synthesis-oriented
AI Analysis

This work addresses the effort-intensive labeling problem in video-based hand gesture recognition for applications like human-computer interaction, but it is incremental as it applies an existing Few-Shot Learning method to a modified dataset.

The paper tackles dynamic hand gesture recognition by developing Few-Shot Learning models based on Relation Networks with LSTM backbones, achieving up to 88.8% accuracy for five gestures and 81.2% for ten gestures, and showing potential savings of up to 630 and 1260 observations compared to traditional deep learning.

We develop Few-Shot Learning models trained to recognize five or ten different dynamic hand gestures, respectively, which are arbitrarily interchangeable by providing the model with one, two, or five examples per hand gesture. All models were built in the Few-Shot Learning architecture of the Relation Network (RN), in which Long-Short-Term Memory cells form the backbone. The models use hand reference points extracted from RGB-video sequences of the Jester dataset which was modified to contain 190 different types of hand gestures. Result show accuracy of up to 88.8% for recognition of five and up to 81.2% for ten dynamic hand gestures. The research also sheds light on the potential effort savings of using a Few-Shot Learning approach instead of a traditional Deep Learning approach to detect dynamic hand gestures. Savings were defined as the number of additional observations required when a Deep Learning model is trained on new hand gestures instead of a Few Shot Learning model. The difference with respect to the total number of observations required to achieve approximately the same accuracy indicates potential savings of up to 630 observations for five and up to 1260 observations for ten hand gestures to be recognized. Since labeling video recordings of hand gestures implies significant effort, these savings can be considered substantial.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes