HCAIROFeb 15, 2025

GenComUI: Exploring Generative Visual Aids as Medium to Support Task-Oriented Human-Robot Communication

arXiv:2502.10678v17 citationsh-index: 5CHI
Originality Incremental advance
AI Analysis

This addresses the problem of complex communication scenarios in human-robot interaction for users, though it appears incremental as it builds on existing LLM and visual aid methods.

This work tackled the problem of improving human-robot task communication by developing GenComUI, a system that uses generative visual aids to support verbal communication, resulting in enhanced effectiveness as shown in a user experiment with 20 participants compared to a voice-only baseline.

This work investigates the integration of generative visual aids in human-robot task communication. We developed GenComUI, a system powered by large language models that dynamically generates contextual visual aids (such as map annotations, path indicators, and animations) to support verbal task communication and facilitate the generation of customized task programs for the robot. This system was informed by a formative study that examined how humans use external visual tools to assist verbal communication in spatial tasks. To evaluate its effectiveness, we conducted a user experiment (n = 20) comparing GenComUI with a voice-only baseline. The results demonstrate that generative visual aids, through both qualitative and quantitative analysis, enhance verbal task communication by providing continuous visual feedback, thus promoting natural and effective human-robot communication. Additionally, the study offers a set of design implications, emphasizing how dynamically generated visual aids can serve as an effective communication medium in human-robot interaction. These findings underscore the potential of generative visual aids to inform the design of more intuitive and effective human-robot communication, particularly for complex communication scenarios in human-robot interaction and LLM-based end-user development.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes