RO CVNov 30, 2025

Sign Language Recognition using Bidirectional Reservoir Computing

Nitin Kumar Singh, Arie Rachmad Syulistyo, Yuichiro Tanaka, Hakaru Tamukoh

arXiv:2512.00777v15.71 citationsh-index: 7

Originality Incremental advance

AI Analysis

This work addresses the problem of high computational demands in sign language recognition for resource-constrained devices, offering an incremental improvement in efficiency.

The paper tackled sign language recognition by proposing an efficient system using MediaPipe and bidirectional reservoir computing, achieving 57.71% accuracy and reducing training time from 55 minutes to 9 seconds compared to a deep learning method.

Sign language recognition (SLR) facilitates communication between deaf and hearing individuals. Deep learning is widely used to develop SLR-based systems; however, it is computationally intensive and requires substantial computational resources, making it unsuitable for resource-constrained devices. To address this, we propose an efficient sign language recognition system using MediaPipe and an echo state network (ESN)-based bidirectional reservoir computing (BRC) architecture. MediaPipe extracts hand joint coordinates, which serve as inputs to the ESN-based BRC architecture. The BRC processes these features in both forward and backward directions, efficiently capturing temporal dependencies. The resulting states of BRC are concatenated to form a robust representation for classification. We evaluated our method on the Word-Level American Sign Language (WLASL) video dataset, achieving a competitive accuracy of 57.71% and a significantly lower training time of only 9 seconds, in contrast to the 55 minutes and $38$ seconds required by the deep learning-based Bi-GRU approach. Consequently, the BRC-based SLR system is well-suited for edge devices.

View on arXiv PDF

Similar