CVOct 11, 2024

Bukva: Russian Sign Language Alphabet

arXiv:2410.08675v1h-index: 3Has Code
Originality Incremental advance
AI Analysis

This work provides a foundational dataset for Russian Sign Language recognition, benefiting the deaf and hard-of-hearing community by enabling better tools for communication and learning.

The paper tackles the problem of recognizing the Russian Sign Language fingerspelling alphabet by addressing data shortages with a new open-source video dataset called Bukva, achieving 83.6% top-1 accuracy using a TSM-based method for real-time inference on CPU.

This paper investigates the recognition of the Russian fingerspelling alphabet, also known as the Russian Sign Language (RSL) dactyl. Dactyl is a component of sign languages where distinct hand movements represent individual letters of a written language. This method is used to spell words without specific signs, such as proper nouns or technical terms. The alphabet learning simulator is an essential isolated dactyl recognition application. There is a notable issue of data shortage in isolated dactyl recognition: existing Russian dactyl datasets lack subject heterogeneity, contain insufficient samples, or cover only static signs. We provide Bukva, the first full-fledged open-source video dataset for RSL dactyl recognition. It contains 3,757 videos with more than 101 samples for each RSL alphabet sign, including dynamic ones. We utilized crowdsourcing platforms to increase the subject's heterogeneity, resulting in the participation of 155 deaf and hard-of-hearing experts in the dataset creation. We use a TSM (Temporal Shift Module) block to handle static and dynamic signs effectively, achieving 83.6% top-1 accuracy with a real-time inference with CPU only. The dataset, demo code, and pre-trained models are publicly available.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes