CVMMMay 21, 2020

Manifold Alignment for Semantically Aligned Style Transfer

arXiv:2005.10777v258 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses the issue of corrupted content structure in style transfer for computer vision applications, offering a flexible method with user editing capabilities, though it is incremental in improving semantic alignment.

The paper tackles the problem of style transfer by addressing the neglect of semantic structure in existing methods, proposing a manifold alignment framework that aligns multi-manifold distributions to allow semantically similar regions to share style patterns, resulting in effective artistic and photorealistic style transfer as verified by experiments.

Most existing style transfer methods follow the assumption that styles can be represented with global statistics (e.g., Gram matrices or covariance matrices), and thus address the problem by forcing the output and style images to have similar global statistics. An alternative is the assumption of local style patterns, where algorithms are designed to swap similar local features of content and style images. However, the limitation of these existing methods is that they neglect the semantic structure of the content image which may lead to corrupted content structure in the output. In this paper, we make a new assumption that image features from the same semantic region form a manifold and an image with multiple semantic regions follows a multi-manifold distribution. Based on this assumption, the style transfer problem is formulated as aligning two multi-manifold distributions and a Manifold Alignment based Style Transfer (MAST) framework is proposed. The proposed framework allows semantically similar regions between the output and the style image share similar style patterns. Moreover, the proposed manifold alignment method is flexible to allow user editing or using semantic segmentation maps as guidance for style transfer. To allow the method to be applicable to photorealistic style transfer, we propose a new adaptive weight skip connection network structure to preserve the content details. Extensive experiments verify the effectiveness of the proposed framework for both artistic and photorealistic style transfer. Code is available at https://github.com/NJUHuoJing/MAST.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes