HCGRJan 22, 2024

VOICE: Visual Oracle for Interaction, Conversation, and Explanation

arXiv:2304.0408312 citationsh-index: 34
AI Analysis

For educators and science communicators, VOICE provides a novel framework to make complex 3D data accessible via natural language, though evaluation is limited to expert feedback without quantitative benchmarks.

VOICE connects LLM conversational abilities with interactive 3D visualization for science communication, demonstrating low-latency, high-accuracy voice-driven navigation and explanation of molecular models, with expert evaluation showing potential.

We present VOICE, a novel approach to science communication that connects large language models' (LLM) conversational capabilities with interactive exploratory visualization. VOICE introduces several innovative technical contributions that drive our conversational visualization framework. Our foundation is a pack-of-bots that can perform specific tasks, such as assigning tasks, extracting instructions, and generating coherent content. We employ fine-tuning and prompt engineering techniques to tailor bots' performance to their specific roles and accurately respond to user queries. Our interactive text-to-visualization method generates a flythrough sequence matching the content explanation. Besides, natural language interaction provides capabilities to navigate and manipulate the 3D models in real-time. The VOICE framework can receive arbitrary voice commands from the user and respond verbally, tightly coupled with corresponding visual representation with low latency and high accuracy. We demonstrate the effectiveness of our approach by applying it to the molecular visualization domain: analyzing three 3D molecular models with multi-scale and multi-instance attributes. We finally evaluate VOICE with the identified educational experts to show the potential of our approach. All supplemental materials are available at https://osf.io/g7fbr.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes