HCAIApr 8, 2025

Towards an AI-Driven Video-Based American Sign Language Dictionary: Exploring Design and Usage Experience with Learners

arXiv:2504.05857v11 citationsh-index: 8Has CodeW4A
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge for ASL learners who cannot use text queries to look up signs, though it is incremental by building on previous HCI research and existing technology.

The researchers tackled the problem of searching for unfamiliar American Sign Language signs by developing an automated video-based dictionary using state-of-the-art sign recognition, and they found that it addressed human-AI interaction challenges like recording issues and system latency in a study with twelve novice learners.

Searching for unfamiliar American Sign Language (ASL) signs is challenging for learners because, unlike spoken languages, they cannot type a text-based query to look up an unfamiliar sign. Advances in isolated sign recognition have enabled the creation of video-based dictionaries, allowing users to submit a video and receive a list of the closest matching signs. Previous HCI research using Wizard-of-Oz prototypes has explored interface designs for ASL dictionaries. Building on these studies, we incorporate their design recommendations and leverage state-of-the-art sign-recognition technology to develop an automated video-based dictionary. We also present findings from an observational study with twelve novice ASL learners who used this dictionary during video-comprehension and question-answering tasks. Our results address human-AI interaction challenges not covered in previous WoZ research, including recording and resubmitting signs, unpredictable outputs, system latency, and privacy concerns. These insights offer guidance for designing and deploying video-based ASL dictionary systems.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes