CVCLLGJul 3, 2025

SciGA: A Comprehensive Dataset for Designing Graphical Abstracts in Academic Papers

arXiv:2507.02212v11 citationsh-index: 4
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of enhancing visual scientific communication for researchers by providing a dataset and tools to support GA design, though it is incremental as it builds on existing data and tasks.

The authors tackled the challenge of designing effective graphical abstracts (GAs) for scientific papers by introducing SciGA-145k, a dataset of 145,000 papers and 1.14 million figures, and proposed baseline models for GA recommendation tasks, achieving preliminary results with a novel metric called CAR.

Graphical Abstracts (GAs) play a crucial role in visually conveying the key findings of scientific papers. While recent research has increasingly incorporated visual materials such as Figure 1 as de facto GAs, their potential to enhance scientific communication remains largely unexplored. Moreover, designing effective GAs requires advanced visualization skills, creating a barrier to their widespread adoption. To tackle these challenges, we introduce SciGA-145k, a large-scale dataset comprising approximately 145,000 scientific papers and 1.14 million figures, explicitly designed for supporting GA selection and recommendation as well as facilitating research in automated GA generation. As a preliminary step toward GA design support, we define two tasks: 1) Intra-GA recommendation, which identifies figures within a given paper that are well-suited to serve as GAs, and 2) Inter-GA recommendation, which retrieves GAs from other papers to inspire the creation of new GAs. We provide reasonable baseline models for these tasks. Furthermore, we propose Confidence Adjusted top-1 ground truth Ratio (CAR), a novel recommendation metric that offers a fine-grained analysis of model behavior. CAR addresses limitations in traditional ranking-based metrics by considering cases where multiple figures within a paper, beyond the explicitly labeled GA, may also serve as GAs. By unifying these tasks and metrics, our SciGA-145k establishes a foundation for advancing visual scientific communication while contributing to the development of AI for Science.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes