HCOct 20, 2020

Calliope: Automatic Visual Data Story Generation from a Spreadsheet

arXiv:2010.09975v1184 citations
Originality Incremental advance
AI Analysis

This addresses the problem of technical barriers in data storytelling for users, but it is incremental as it builds on existing authoring tools with a new algorithmic approach.

The paper tackles the difficulty of generating visual data stories from spreadsheets by introducing Calliope, an automatic system that uses a logic-oriented Monte Carlo tree search algorithm to create and organize data facts, with evaluation showing it benefits efficient story generation.

Visual data stories shown in the form of narrative visualizations such as a poster or a data video, are frequently used in data-oriented storytelling to facilitate the understanding and memorization of the story content. Although useful, technique barriers, such as data analysis, visualization, and scripting, make the generation of a visual data story difficult. Existing authoring tools rely on users' skills and experiences, which are usually inefficient and still difficult. In this paper, we introduce a novel visual data story generating system, Calliope, which creates visual data stories from an input spreadsheet through an automatic process and facilities the easy revision of the generated story based on an online story editor. Particularly, Calliope incorporates a new logic-oriented Monte Carlo tree search algorithm that explores the data space given by the input spreadsheet to progressively generate story pieces (i.e., data facts) and organize them in a logical order. The importance of data facts is measured based on information theory, and each data fact is visualized in a chart and captioned by an automatically generated description. We evaluate the proposed technique through three example stories, two controlled experiments, and a series of interviews with 10 domain experts. Our evaluation shows that Calliope is beneficial to efficient visual data story generation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes