CVLGNov 9, 2023

LCM-LoRA: A Universal Stable-Diffusion Acceleration Module

arXiv:2311.05556v1246 citationsh-index: 15Has Code
Originality Incremental advance
AI Analysis

This provides a universal acceleration solution for diverse image generation tasks, though it is incremental as it builds on existing LCM and LoRA techniques.

The authors tackled the problem of accelerating text-to-image generation by extending Latent Consistency Models (LCMs) to larger Stable-Diffusion models with reduced memory usage, achieving superior image quality and enabling a plug-in acceleration module called LCM-LoRA that works without additional training.

Latent Consistency Models (LCMs) have achieved impressive performance in accelerating text-to-image generative tasks, producing high-quality images with minimal inference steps. LCMs are distilled from pre-trained latent diffusion models (LDMs), requiring only ~32 A100 GPU training hours. This report further extends LCMs' potential in two aspects: First, by applying LoRA distillation to Stable-Diffusion models including SD-V1.5, SSD-1B, and SDXL, we have expanded LCM's scope to larger models with significantly less memory consumption, achieving superior image generation quality. Second, we identify the LoRA parameters obtained through LCM distillation as a universal Stable-Diffusion acceleration module, named LCM-LoRA. LCM-LoRA can be directly plugged into various Stable-Diffusion fine-tuned models or LoRAs without training, thus representing a universally applicable accelerator for diverse image generation tasks. Compared with previous numerical PF-ODE solvers such as DDIM, DPM-Solver, LCM-LoRA can be viewed as a plug-in neural PF-ODE solver that possesses strong generalization abilities. Project page: https://github.com/luosiallen/latent-consistency-model.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes