CVAug 26, 2022

Adaptively-Realistic Image Generation from Stroke and Sketch with Diffusion Model

arXiv:2208.12675v280 citationsh-index: 30
Originality Highly original
AI Analysis

This addresses the challenge of infinite possibilities and varied user expectations in content creation from sketches and strokes, enabling applications like image editing and multi-modal synthesis.

The paper tackles the problem of generating images from hand-drawings by proposing a unified diffusion-based framework that allows three-dimensional control over faithfulness, realism, and customization, achieving state-of-the-art performance in qualitative and quantitative experiments.

Generating images from hand-drawings is a crucial and fundamental task in content creation. The translation is difficult as there exist infinite possibilities and the different users usually expect different outcomes. Therefore, we propose a unified framework supporting a three-dimensional control over the image synthesis from sketches and strokes based on diffusion models. Users can not only decide the level of faithfulness to the input strokes and sketches, but also the degree of realism, as the user inputs are usually not consistent with the real images. Qualitative and quantitative experiments demonstrate that our framework achieves state-of-the-art performance while providing flexibility in generating customized images with control over shape, color, and realism. Moreover, our method unleashes applications such as editing on real images, generation with partial sketches and strokes, and multi-domain multi-modal synthesis.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes