CVIVDec 3, 2024

GIST: Towards Photorealistic Style Transfer via Multiscale Geometric Representations

arXiv:2412.02214v11 citationsh-index: 4
Originality Incremental advance
AI Analysis

This addresses the issue of photorealism in style transfer for image synthesis applications, though it appears incremental as it builds on existing geometric representations.

The paper tackles the problem of artifacts and loss of photorealism in style transfer by proposing GIST, a method that uses multiscale geometric representations to preserve details without training, achieving on-par or better performance than recent approaches while significantly reducing processing time.

State-of-the-art Style Transfer methods often leverage pre-trained encoders optimized for discriminative tasks, which may not be ideal for image synthesis. This can result in significant artifacts and loss of photorealism. Motivated by the ability of multiscale geometric image representations to capture fine-grained details and global structure, we propose GIST: Geometric-based Image Style Transfer, a novel Style Transfer technique that exploits the geometric properties of content and style images. GIST replaces the standard Neural Style Transfer autoencoding framework with a multiscale image expansion, preserving scene details without the need for post-processing or training. Our method matches multiresolution and multidirectional representations such as Wavelets and Contourlets by solving an optimal transport problem, leading to an efficient texture transferring. Experiments show that GIST is on-par or outperforms recent photorealistic Style Transfer approaches while significantly reducing the processing time with no model training.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes