HC AIApr 8, 2025

Towards an AI-Driven Video-Based American Sign Language Dictionary: Exploring Design and Usage Experience with Learners

Saad Hassan, Matyas Bohacek, Chaelin Kim, Denise Crochet

arXiv:2504.05857v14.11 citationsh-index: 8Has CodeW4A

Originality Synthesis-oriented

AI Analysis

This work addresses the challenge for ASL learners who cannot use text queries to look up signs, though it is incremental by building on previous HCI research and existing technology.

The researchers tackled the problem of searching for unfamiliar American Sign Language signs by developing an automated video-based dictionary using state-of-the-art sign recognition, and they found that it addressed human-AI interaction challenges like recording issues and system latency in a study with twelve novice learners.

Searching for unfamiliar American Sign Language (ASL) signs is challenging for learners because, unlike spoken languages, they cannot type a text-based query to look up an unfamiliar sign. Advances in isolated sign recognition have enabled the creation of video-based dictionaries, allowing users to submit a video and receive a list of the closest matching signs. Previous HCI research using Wizard-of-Oz prototypes has explored interface designs for ASL dictionaries. Building on these studies, we incorporate their design recommendations and leverage state-of-the-art sign-recognition technology to develop an automated video-based dictionary. We also present findings from an observational study with twelve novice ASL learners who used this dictionary during video-comprehension and question-answering tasks. Our results address human-AI interaction challenges not covered in previous WoZ research, including recording and resubmitting signs, unpredictable outputs, system latency, and privacy concerns. These insights offer guidance for designing and deploying video-based ASL dictionary systems.

View on arXiv PDF Code

Similar