CVNov 30, 2025

Charts Are Not Images: On the Challenges of Scientific Chart Editing

Shawn Li, Ryan Rossi, Sungchul Kim, Sunav Choudhary, Franck Dernoncourt, Puneet Mathur, Zhengzhong Tu, Yue Zhao

arXiv:2512.00752v114.44 citationsh-index: 13Has Code

Originality Synthesis-oriented

AI Analysis

This addresses the challenge of automated scientific chart editing for researchers and data scientists, providing a benchmark to evaluate structure-aware models, though it is incremental in benchmarking rather than proposing a new method.

The paper tackles the problem of applying generative models to scientific chart editing by showing that treating charts as pixel arrangements fails to handle their structured nature, and introduces FigEdit, a benchmark with over 30,000 samples across 10 chart types and five tasks, revealing poor performance of state-of-the-art models on structured transformations.

Generative models, such as diffusion and autoregressive approaches, have demonstrated impressive capabilities in editing natural images. However, applying these tools to scientific charts rests on a flawed assumption: a chart is not merely an arrangement of pixels but a visual representation of structured data governed by a graphical grammar. Consequently, chart editing is not a pixel-manipulation task but a structured transformation problem. To address this fundamental mismatch, we introduce \textit{FigEdit}, a large-scale benchmark for scientific figure editing comprising over 30,000 samples. Grounded in real-world data, our benchmark is distinguished by its diversity, covering 10 distinct chart types and a rich vocabulary of complex editing instructions. The benchmark is organized into five distinct and progressively challenging tasks: single edits, multi edits, conversational edits, visual-guidance-based edits, and style transfer. Our evaluation of a range of state-of-the-art models on this benchmark reveals their poor performance on scientific figures, as they consistently fail to handle the underlying structured transformations required for valid edits. Furthermore, our analysis indicates that traditional evaluation metrics (e.g., SSIM, PSNR) have limitations in capturing the semantic correctness of chart edits. Our benchmark demonstrates the profound limitations of pixel-level manipulation and provides a robust foundation for developing and evaluating future structure-aware models. By releasing \textit{FigEdit} (https://github.com/adobe-research/figure-editing), we aim to enable systematic progress in structure-aware figure editing, provide a common ground for fair comparison, and encourage future research on models that understand both the visual and semantic layers of scientific charts.

View on arXiv PDF Code

Similar