CLAIHCJun 27, 2024

Captioning Visualizations with Large Language Models (CVLLM): A Tutorial

arXiv:2406.19512v1
Originality Synthesis-oriented
AI Analysis

This is an incremental tutorial for researchers and practitioners in information visualization, summarizing existing work without introducing novel advancements.

The paper reviews the application of large language models (LLMs) to automatically caption visualizations, discussing existing methods and future directions without presenting new experimental results or specific performance metrics.

Automatically captioning visualizations is not new, but recent advances in large language models(LLMs) open exciting new possibilities. In this tutorial, after providing a brief review of Information Visualization (InfoVis) principles and past work in captioning, we introduce neural models and the transformer architecture used in generic LLMs. We then discuss their recent applications in InfoVis, with a focus on captioning. Additionally, we explore promising future directions in this field.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes