CVFeb 21

Rethinking Preference Alignment for Diffusion Models with Classifier-Free Guidance

arXiv:2602.18799v11 citations
Originality Incremental advance
AI Analysis

This addresses the problem of nuanced preference alignment for users of diffusion models, offering an incremental improvement over existing methods.

The paper tackled the challenge of aligning text-to-image diffusion models with human preferences by proposing a method that uses classifier-free guidance with a contrastive guidance vector, achieving consistent quantitative and qualitative gains on Stable Diffusion models.

Aligning large-scale text-to-image diffusion models with nuanced human preferences remains challenging. While direct preference optimization (DPO) is simple and effective, large-scale finetuning often shows a generalization gap. We take inspiration from test-time guidance and cast preference alignment as classifier-free guidance (CFG): a finetuned preference model acts as an external control signal during sampling. Building on this view, we propose a simple method that improves alignment without retraining the base model. To further enhance generalization, we decouple preference learning into two modules trained on positive and negative data, respectively, and form a \emph{contrastive guidance} vector at inference by subtracting their predictions (positive minus negative), scaled by a user-chosen strength and added to the base prediction at each step. This yields a sharper and controllable alignment signal. We evaluate on Stable Diffusion 1.5 and Stable Diffusion XL with Pick-a-Pic v2 and HPDv3, showing consistent quantitative and qualitative gains.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes