CVAIMay 30, 2023

DiffSketching: Sketch Control Image Synthesis with Diffusion Models

arXiv:2305.18812v126 citations
Originality Incremental advance
AI Analysis

This addresses the problem of sketch-to-image synthesis for creative users, offering an incremental improvement over existing methods.

The paper tackles the challenge of translating abstract sketches into realistic images by using diffusion models with cross-domain constraints and classifier guidance, achieving higher generation quality and human evaluation scores than GAN-based methods without needing large sketch-image datasets.

Creative sketch is a universal way of visual expression, but translating images from an abstract sketch is very challenging. Traditionally, creating a deep learning model for sketch-to-image synthesis needs to overcome the distorted input sketch without visual details, and requires to collect large-scale sketch-image datasets. We first study this task by using diffusion models. Our model matches sketches through the cross domain constraints, and uses a classifier to guide the image synthesis more accurately. Extensive experiments confirmed that our method can not only be faithful to user's input sketches, but also maintain the diversity and imagination of synthetic image results. Our model can beat GAN-based method in terms of generation quality and human evaluation, and does not rely on massive sketch-image datasets. Additionally, we present applications of our method in image editing and interpolation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes