CLAILGJun 27, 2024

SignSpeak: Open-Source Time Series Classification for ASL Translation

arXiv:2407.12020v2Has Code
Originality Synthesis-oriented
AI Analysis

This provides a cost-effective solution for hearing and speech-impaired communities, though it appears incremental in method.

The authors tackled the problem of ASL-to-speech translation by developing a low-cost glove and dataset, achieving 92% accuracy with their best model on 36 classes.

The lack of fluency in sign language remains a barrier to seamless communication for hearing and speech-impaired communities. In this work, we propose a low-cost, real-time ASL-to-speech translation glove and an exhaustive training dataset of sign language patterns. We then benchmarked this dataset with supervised learning models, such as LSTMs, GRUs and Transformers, where our best model achieved 92% accuracy. The SignSpeak dataset has 7200 samples encompassing 36 classes (A-Z, 1-10) and aims to capture realistic signing patterns by using five low-cost flex sensors to measure finger positions at each time step at 36 Hz. Our open-source dataset, models and glove designs, provide an accurate and efficient ASL translator while maintaining cost-effectiveness, establishing a framework for future work to build on.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes