CVJan 20, 2025

StyleSSP: Sampling StartPoint Enhancement for Training-free Diffusion-based Method for Style Transfer

arXiv:2501.11319v217 citationsh-index: 1Has CodeCVPR
AI Analysis

This work addresses style transfer for users needing efficient, training-free methods, but it is incremental as it builds on existing diffusion-based approaches.

The paper tackles the problem of layout changes and content leakage in training-free diffusion-based style transfer by proposing StyleSSP, which enhances the sampling startpoint through frequency manipulation and negative guidance, resulting in improved content preservation and reduced leakage compared to previous baselines.

Training-free diffusion-based methods have achieved remarkable success in style transfer, eliminating the need for extensive training or fine-tuning. However, due to the lack of targeted training for style information extraction and constraints on the content image layout, training-free methods often suffer from layout changes of original content and content leakage from style images. Through a series of experiments, we discovered that an effective startpoint in the sampling stage significantly enhances the style transfer process. Based on this discovery, we propose StyleSSP, which focuses on obtaining a better startpoint to address layout changes of original content and content leakage from style image. StyleSSP comprises two key components: (1) Frequency Manipulation: To improve content preservation, we reduce the low-frequency components of the DDIM latent, allowing the sampling stage to pay more attention to the layout of content images; and (2) Negative Guidance via Inversion: To mitigate the content leakage from style image, we employ negative guidance in the inversion stage to ensure that the startpoint of the sampling stage is distanced from the content of style image. Experiments show that StyleSSP surpasses previous training-free style transfer baselines, particularly in preserving original content and minimizing the content leakage from style image. Project page: https://github.com/bytedance/StyleSSP.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes