CLAIDBAug 14, 2024

DataVisT5: A Pre-trained Language Model for Jointly Understanding Text and Data Visualization

arXiv:2408.07401v25 citationsh-index: 6
Originality Incremental advance
AI Analysis

This work addresses the challenge of automating data visualization tasks for data analysts and researchers, representing an incremental advancement in domain-specific PLMs.

The authors tackled the problem of applying pre-trained language models to data visualization tasks by introducing DataVisT5, a model that outperforms state-of-the-art methods on tasks like text-to-vis and vis-to-text across public datasets.

Data visualization (DV) is the fundamental and premise tool to improve the efficiency in conveying the insights behind the big data, which has been widely accepted in existing data-driven world. Task automation in DV, such as converting natural language queries to visualizations (i.e., text-to-vis), generating explanations from visualizations (i.e., vis-to-text), answering DV-related questions in free form (i.e. FeVisQA), and explicating tabular data (i.e., table-to-text), is vital for advancing the field. Despite their potential, the application of pre-trained language models (PLMs) like T5 and BERT in DV has been limited by high costs and challenges in handling cross-modal information, leading to few studies on PLMs for DV. We introduce DataVisT5, a novel PLM tailored for DV that enhances the T5 architecture through a hybrid objective pre-training and multi-task fine-tuning strategy, integrating text and DV datasets to effectively interpret cross-modal semantics. Extensive evaluations on public datasets show that DataVisT5 consistently outperforms current state-of-the-art models on various DV-related tasks. We anticipate that DataVisT5 will not only inspire further research on vertical PLMs but also expand the range of applications for PLMs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes