CVSep 19, 2023

On-device Real-time Custom Hand Gesture Recognition

arXiv:2309.10858v111 citationsh-index: 6Has Code
Originality Incremental advance
AI Analysis

This addresses the need for customizable gesture recognition for users and developers, though it is incremental as it builds on existing embedding models and deployment tools.

The paper tackles the problem of recognizing custom, unseen hand gestures by introducing a user-friendly framework that allows users to easily collect data and deploy real-time recognition models on-device, with the entire process taking only a few minutes.

Most existing hand gesture recognition (HGR) systems are limited to a predefined set of gestures. However, users and developers often want to recognize new, unseen gestures. This is challenging due to the vast diversity of all plausible hand shapes, e.g. it is impossible for developers to include all hand gestures in a predefined list. In this paper, we present a user-friendly framework that lets users easily customize and deploy their own gesture recognition pipeline. Our framework provides a pre-trained single-hand embedding model that can be fine-tuned for custom gesture recognition. Users can perform gestures in front of a webcam to collect a small amount of images per gesture. We also offer a low-code solution to train and deploy the custom gesture recognition model. This makes it easy for users with limited ML expertise to use our framework. We further provide a no-code web front-end for users without any ML expertise. This makes it even easier to build and test the end-to-end pipeline. The resulting custom HGR is then ready to be run on-device for real-time scenarios. This can be done by calling a simple function in our open-sourced model inference API, MediaPipe Tasks. This entire process only takes a few minutes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes