CVDec 4, 2017

Towards Realistic Face Photo-Sketch Synthesis via Composition-Aided GANs

arXiv:1712.00899v414 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of realistic face photo-sketch synthesis for applications in digital entertainment and law enforcement, representing an incremental improvement over existing methods.

The paper tackled the problem of generating realistic face photos or sketches from sketches or photos, respectively, by proposing a composition-aided GAN that uses facial composition information and a compositional reconstruction loss to improve structural realism and textural consistency. It achieved state-of-the-art quality, reducing the best previous Frechet Inception distance (FID) by a large margin.

Face photo-sketch synthesis aims at generating a facial sketch/photo conditioned on a given photo/sketch. It is of wide applications including digital entertainment and law enforcement. Precisely depicting face photos/sketches remains challenging due to the restrictions on structural realism and textural consistency. While existing methods achieve compelling results, they mostly yield blurred effects and great deformation over various facial components, leading to the unrealistic feeling of synthesized images. To tackle this challenge, in this work, we propose to use the facial composition information to help the synthesis of face sketch/photo. Specially, we propose a novel composition-aided generative adversarial network (CA-GAN) for face photo-sketch synthesis. In CA-GAN, we utilize paired inputs including a face photo/sketch and the corresponding pixel-wise face labels for generating a sketch/photo. In addition, to focus training on hard-generated components and delicate facial structures, we propose a compositional reconstruction loss. Finally, we use stacked CA-GANs (SCA-GAN) to further rectify defects and add compelling details. Experimental results show that our method is capable of generating both visually comfortable and identity-preserving face sketches/photos over a wide range of challenging data. Our method achieves the state-of-the-art quality, reducing best previous Frechet Inception distance (FID) by a large margin. Besides, we demonstrate that the proposed method is of considerable generalization ability. We have made our code and results publicly available: https://fei-hdu.github.io/ca-gan/.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes