AIDec 1, 2025

Flowchart2Mermaid: A Vision-Language Model Powered System for Converting Flowcharts into Editable Diagram Code

arXiv:2512.02170v13 citationsh-index: 5
Originality Incremental advance
AI Analysis

This addresses the issue of non-editable flowcharts for users needing reusable diagrams, though it is incremental as it builds on existing image-to-diagram tools.

The authors tackled the problem of converting static flowchart images into editable code by developing Flowchart2Mermaid, a web system that uses vision-language models to generate Mermaid.js markup, achieving structured and synchronized outputs.

Flowcharts are common tools for communicating processes but are often shared as static images that cannot be easily edited or reused. We present \textsc{Flowchart2Mermaid}, a lightweight web system that converts flowchart images into editable Mermaid.js code which is a markup language for visual workflows, using a detailed system prompt and vision-language models. The interface supports mixed-initiative refinement through inline text editing, drag-and-drop node insertion, and natural-language commands interpreted by an integrated AI assistant. Unlike prior image-to-diagram tools, our approach produces a structured, version-controllable textual representation that remains synchronized with the rendered diagram. We further introduce evaluation metrics to assess structural accuracy, flow correctness, syntax validity, and completeness across multiple models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes