CVNov 6, 2025

Text to Sketch Generation with Multi-Styles

arXiv:2511.04123v11 citationsh-index: 13Has Code
Originality Incremental advance
AI Analysis

This work addresses the need for flexible style control in sketch generation for creative applications, representing an incremental improvement over existing methods.

The paper tackles the problem of generating sketches from text with precise style control, proposing a training-free diffusion framework that reduces content leakage and achieves high-quality, multi-style sketch synthesis.

Recent advances in vision-language models have facilitated progress in sketch generation. However, existing specialized methods primarily focus on generic synthesis and lack mechanisms for precise control over sketch styles. In this work, we propose a training-free framework based on diffusion models that enables explicit style guidance via textual prompts and referenced style sketches. Unlike previous style transfer methods that overwrite key and value matrices in self-attention, we incorporate the reference features as auxiliary information with linear smoothing and leverage a style-content guidance mechanism. This design effectively reduces content leakage from reference sketches and enhances synthesis quality, especially in cases with low structural similarity between reference and target sketches. Furthermore, we extend our framework to support controllable multi-style generation by integrating features from multiple reference sketches, coordinated via a joint AdaIN module. Extensive experiments demonstrate that our approach achieves high-quality sketch generation with accurate style alignment and improved flexibility in style control. The official implementation of M3S is available at https://github.com/CMACH508/M3S.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes