CV AIMay 30, 2023

DiffSketching: Sketch Control Image Synthesis with Diffusion Models

Qiang Wang, Di Kong, Fengyin Lin, Yonggang Qi

arXiv:2305.18812v110.426 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the problem of sketch-to-image synthesis for creative users, offering an incremental improvement over existing methods.

The paper tackles the challenge of translating abstract sketches into realistic images by using diffusion models with cross-domain constraints and classifier guidance, achieving higher generation quality and human evaluation scores than GAN-based methods without needing large sketch-image datasets.

Creative sketch is a universal way of visual expression, but translating images from an abstract sketch is very challenging. Traditionally, creating a deep learning model for sketch-to-image synthesis needs to overcome the distorted input sketch without visual details, and requires to collect large-scale sketch-image datasets. We first study this task by using diffusion models. Our model matches sketches through the cross domain constraints, and uses a classifier to guide the image synthesis more accurately. Extensive experiments confirmed that our method can not only be faithful to user's input sketches, but also maintain the diversity and imagination of synthetic image results. Our model can beat GAN-based method in terms of generation quality and human evaluation, and does not rely on massive sketch-image datasets. Additionally, we present applications of our method in image editing and interpolation.

View on arXiv PDF Code

Similar