CVGRSep 9, 2020

MU-GAN: Facial Attribute Editing based on Multi-attention Mechanism

arXiv:2009.04177v148 citations
Originality Incremental advance
AI Analysis

This work addresses facial attribute editing for image generation applications, representing an incremental improvement over existing methods.

The paper tackles facial attribute editing by proposing MU-GAN, a method that uses multi-attention mechanisms to improve attribute manipulation and preserve details, resulting in outperforming state-of-the-art methods in accuracy and image quality.

Facial attribute editing has mainly two objectives: 1) translating image from a source domain to a target one, and 2) only changing the facial regions related to a target attribute and preserving the attribute-excluding details. In this work, we propose a Multi-attention U-Net-based Generative Adversarial Network (MU-GAN). First, we replace a classic convolutional encoder-decoder with a symmetric U-Net-like structure in a generator, and then apply an additive attention mechanism to build attention-based U-Net connections for adaptively transferring encoder representations to complement a decoder with attribute-excluding detail and enhance attribute editing ability. Second, a self-attention mechanism is incorporated into convolutional layers for modeling long-range and multi-level dependencies across image regions. experimental results indicate that our method is capable of balancing attribute editing ability and details preservation ability, and can decouple the correlation among attributes. It outperforms the state-of-the-art methods in terms of attribute manipulation accuracy and image quality.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes