HCAICVDec 10, 2025

ImageTalk: Designing a Multimodal AAC Text Generation System Driven by Image Recognition and Natural Language Generation

arXiv:2512.09610v1h-index: 11
Originality Incremental advance
AI Analysis

This work addresses communication challenges for people with Motor Neuron Disease, offering a novel system with significant performance improvements.

The paper tackled the limited vocabulary and low communication rates in AAC systems for people with Motor Neuron Disease by designing ImageTalk, a multimodal text generation system that achieved 95.6% keystroke savings and high user satisfaction.

People living with Motor Neuron Disease (plwMND) frequently encounter speech and motor impairments that necessitate a reliance on augmentative and alternative communication (AAC) systems. This paper tackles the main challenge that traditional symbol-based AAC systems offer a limited vocabulary, while text entry solutions tend to exhibit low communication rates. To help plwMND articulate their needs about the system efficiently and effectively, we iteratively design and develop a novel multimodal text generation system called ImageTalk through a tailored proxy-user-based and an end-user-based design phase. The system demonstrates pronounced keystroke savings of 95.6%, coupled with consistent performance and high user satisfaction. We distill three design guidelines for AI-assisted text generation systems design and outline four user requirement levels tailored for AAC purposes, guiding future research in this field.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes