CVLGJun 7, 2019

Two-Stage Peer-Regularized Feature Recombination for Arbitrary Image Style Transfer

arXiv:1906.02913v388 citations
Originality Incremental advance
AI Analysis

This addresses the need for flexible and high-quality style transfer in artistic image generation and deployment, though it appears incremental as it builds on existing neural style transfer methods.

The paper tackles the problem of arbitrary image style transfer by introducing a model that generates high-quality stylized images in a zero-shot setting, allowing more freedom in content geometry changes, with results competitive to state-of-the-art using a single model for all styles.

This paper introduces a neural style transfer model to generate a stylized image conditioning on a set of examples describing the desired style. The proposed solution produces high-quality images even in the zero-shot setting and allows for more freedom in changes to the content geometry. This is made possible by introducing a novel Two-Stage Peer-Regularization Layer that recombines style and content in latent space by means of a custom graph convolutional layer. Contrary to the vast majority of existing solutions, our model does not depend on any pre-trained networks for computing perceptual losses and can be trained fully end-to-end thanks to a new set of cyclic losses that operate directly in latent space and not on the RGB images. An extensive ablation study confirms the usefulness of the proposed losses and of the Two-Stage Peer-Regularization Layer, with qualitative results that are competitive with respect to the current state of the art using a single model for all presented styles. This opens the door to more abstract and artistic neural image generation scenarios, along with simpler deployment of the model.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes