GRCVMay 1, 2024

TexSliders: Diffusion-Based Texture Editing in CLIP Space

arXiv:2405.00672v130 citationsh-index: 22SIGGRAPH
Originality Incremental advance
AI Analysis

This addresses a specific bottleneck in 3D content creation pipelines by enabling intuitive texture editing, though it is incremental as it adapts existing diffusion and CLIP techniques to a new domain.

The paper tackles the problem of editing textures using diffusion models, which existing methods fail at due to reliance on attention maps, by proposing a novel approach that manipulates CLIP image embeddings to condition generation, enabling arbitrary sliders with natural language prompts and no annotated data.

Generative models have enabled intuitive image creation and manipulation using natural language. In particular, diffusion models have recently shown remarkable results for natural image editing. In this work, we propose to apply diffusion techniques to edit textures, a specific class of images that are an essential part of 3D content creation pipelines. We analyze existing editing methods and show that they are not directly applicable to textures, since their common underlying approach, manipulating attention maps, is unsuitable for the texture domain. To address this, we propose a novel approach that instead manipulates CLIP image embeddings to condition the diffusion generation. We define editing directions using simple text prompts (e.g., "aged wood" to "new wood") and map these to CLIP image embedding space using a texture prior, with a sampling-based approach that gives us identity-preserving directions in CLIP space. To further improve identity preservation, we project these directions to a CLIP subspace that minimizes identity variations resulting from entangled texture attributes. Our editing pipeline facilitates the creation of arbitrary sliders using natural language prompts only, with no ground-truth annotated data necessary.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes