AIAug 2, 2025

SketchAgent: Generating Structured Diagrams from Hand-Drawn Sketches

arXiv:2508.01237v12 citationsh-index: 26Has CodeIJCAI
Originality Incremental advance
AI Analysis

This addresses the labor-intensive manual task of diagram generation for applications in design, education, and engineering, representing a significant step toward bridging sketching and machine-readable diagrams.

The paper tackles the problem of translating ambiguous hand-drawn sketches into structured, machine-readable diagrams by introducing SketchAgent, a multi-agent system that integrates sketch recognition and symbolic reasoning, achieving this with a new benchmark dataset of over 6,000 annotated examples.

Hand-drawn sketches are a natural and efficient medium for capturing and conveying ideas. Despite significant advancements in controllable natural image generation, translating freehand sketches into structured, machine-readable diagrams remains a labor-intensive and predominantly manual task. The primary challenge stems from the inherent ambiguity of sketches, which lack the structural constraints and semantic precision required for automated diagram generation. To address this challenge, we introduce SketchAgent, a multi-agent system designed to automate the transformation of hand-drawn sketches into structured diagrams. SketchAgent integrates sketch recognition, symbolic reasoning, and iterative validation to produce semantically coherent and structurally accurate diagrams, significantly reducing the need for manual effort. To evaluate the effectiveness of our approach, we propose the Sketch2Diagram Benchmark, a comprehensive dataset and evaluation framework encompassing eight diverse diagram categories, such as flowcharts, directed graphs, and model architectures. The dataset comprises over 6,000 high-quality examples with token-level annotations, standardized preprocessing, and rigorous quality control. By streamlining the diagram generation process, SketchAgent holds great promise for applications in design, education, and engineering, while offering a significant step toward bridging the gap between intuitive sketching and machine-readable diagram generation. The benchmark is released at https://huggingface.co/datasets/DiagramAgent/Sketch2Diagram-Benchmark.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes