CVFeb 16

Controlling Your Image via Simplified Vector Graphics

arXiv:2602.14443v1h-index: 5
Originality Incremental advance
AI Analysis

This addresses the problem of fine-grained image manipulation for users in creative and editing domains, representing a new paradigm rather than an incremental improvement.

The paper tackles the challenge of element-level control in image generation by introducing a method that uses simplified vector graphics to enable intuitive modifications like adjusting shapes and colors, achieving precise control over geometry and semantics in photorealistic outputs.

Recent advances in image generation have achieved remarkable visual quality, while a fundamental challenge remains: Can image generation be controlled at the element level, enabling intuitive modifications such as adjusting shapes, altering colors, or adding and removing objects? In this work, we address this challenge by introducing layer-wise controllable generation through simplified vector graphics (VGs). Our approach first efficiently parses images into hierarchical VG representations that are semantic-aligned and structurally coherent. Building on this representation, we design a novel image synthesis framework guided by VGs, allowing users to freely modify elements and seamlessly translate these edits into photorealistic outputs. By leveraging the structural and semantic features of VGs in conjunction with noise prediction, our method provides precise control over geometry, color, and object semantics. Extensive experiments demonstrate the effectiveness of our approach in diverse applications, including image editing, object-level manipulation, and fine-grained content creation, establishing a new paradigm for controllable image generation. Project page: https://guolanqing.github.io/Vec2Pix/

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes