CVNov 20, 2023

Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models

arXiv:2311.12092v2153 citationsh-index: 9
Originality Highly original
AI Analysis

This method provides incremental improvements for users needing fine-grained control in image generation, particularly for attributes like weather, age, and object deformations.

The authors tackled the problem of achieving precise control over attributes in diffusion model image generation by introducing interpretable concept sliders, which enable targeted edits with lower interference compared to previous techniques.

We present a method to create interpretable concept sliders that enable precise control over attributes in image generations from diffusion models. Our approach identifies a low-rank parameter direction corresponding to one concept while minimizing interference with other attributes. A slider is created using a small set of prompts or sample images; thus slider directions can be created for either textual or visual concepts. Concept Sliders are plug-and-play: they can be composed efficiently and continuously modulated, enabling precise control over image generation. In quantitative experiments comparing to previous editing techniques, our sliders exhibit stronger targeted edits with lower interference. We showcase sliders for weather, age, styles, and expressions, as well as slider compositions. We show how sliders can transfer latents from StyleGAN for intuitive editing of visual concepts for which textual description is difficult. We also find that our method can help address persistent quality issues in Stable Diffusion XL including repair of object deformations and fixing distorted hands. Our code, data, and trained sliders are available at https://sliders.baulab.info/

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes