HCNov 28, 2019

Words of Estimative Correlation: Studying Verbalizations of Scatterplots

arXiv:1911.12793v42 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of improving multimodal data analysis tools for users by providing foundational insights into verbal-visual communication, though it is incremental as it builds on existing research in visualization and natural language.

The researchers investigated how people describe scatterplots showing correlations and how they match verbal descriptions to visualizations, finding that vocabulary varies widely but certain concepts are preferred for higher correlations and some terms are ambiguous.

Natural language and visualization are being increasingly deployed together for supporting data analysis in different ways, from multimodal interaction to enriched data summaries and insights. Yet, researchers still lack systematic knowledge on how viewers verbalize their interpretations of visualizations, and how they interpret verbalizations of visualizations in such contexts. We describe two studies aimed at identifying characteristics of data and charts that are relevant in such tasks. The first study asks participants to verbalize what they see in scatterplots that depict various levels of correlations. The second study then asks participants to choose visualizations that match a given verbal description of correlation. We extract key concepts from responses, organize them in a taxonomy and analyze the categorized responses. We observe that participants use a wide range of vocabulary across all scatterplots, but particular concepts are preferred for higher levels of correlation. A comparison between the studies reveals the ambiguity of some of the concepts. We discuss how the results could inform the design of multimodal representations aligned with the data and analytical tasks, and present a research roadmap to deepen the understanding about visualizations and natural language.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes