CVAICLDec 17, 2023

StarVector: Generating Scalable Vector Graphics Code from Images and Text

MILA
arXiv:2312.11556v469 citationsh-index: 32CVPR
Originality Highly original
AI Analysis

This work solves the problem of scalable and versatile image rendering for applications requiring vector graphics, representing a novel method for a known bottleneck in SVG generation.

The paper tackles the problem of generating Scalable Vector Graphics (SVG) code from images and text, addressing issues like lack of semantic understanding and artifacts in previous methods, and introduces StarVector, a multimodal large language model that achieves state-of-the-art performance by producing more compact and semantically rich SVGs.

Scalable Vector Graphics (SVGs) are vital for modern image rendering due to their scalability and versatility. Previous SVG generation methods have focused on curve-based vectorization, lacking semantic understanding, often producing artifacts, and struggling with SVG primitives beyond path curves. To address these issues, we introduce StarVector, a multimodal large language model for SVG generation. It performs image vectorization by understanding image semantics and using SVG primitives for compact, precise outputs. Unlike traditional methods, StarVector works directly in the SVG code space, leveraging visual understanding to apply accurate SVG primitives. To train StarVector, we create SVG-Stack, a diverse dataset of 2M samples that enables generalization across vectorization tasks and precise use of primitives like ellipses, polygons, and text. We address challenges in SVG evaluation, showing that pixel-based metrics like MSE fail to capture the unique qualities of vector graphics. We introduce SVG-Bench, a benchmark across 10 datasets, and 3 tasks: Image-to-SVG, Text-to-SVG generation, and diagram generation. Using this setup, StarVector achieves state-of-the-art performance, producing more compact and semantically rich SVGs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes