GRAICVMar 5, 2025

ProReflow: Progressive Reflow with Decomposed Velocity

arXiv:2503.04824v15 citationsh-index: 6CVPR
Originality Incremental advance
AI Analysis

This work addresses efficiency issues in diffusion models for image generation, offering incremental improvements to flow matching methods.

The paper tackles the high computational cost of diffusion models by improving flow matching techniques, achieving an FID of 10.70 on MSCOCO2014 with only 4 sampling steps, close to the teacher model's 10.05 with 32 steps.

Diffusion models have achieved significant progress in both image and video generation while still suffering from huge computation costs. As an effective solution, flow matching aims to reflow the diffusion process of diffusion models into a straight line for a few-step and even one-step generation. However, in this paper, we suggest that the original training pipeline of flow matching is not optimal and introduce two techniques to improve it. Firstly, we introduce progressive reflow, which progressively reflows the diffusion models in local timesteps until the whole diffusion progresses, reducing the difficulty of flow matching. Second, we introduce aligned v-prediction, which highlights the importance of direction matching in flow matching over magnitude matching. Experimental results on SDv1.5 and SDXL demonstrate the effectiveness of our method, for example, conducting on SDv1.5 achieves an FID of 10.70 on MSCOCO2014 validation set with only 4 sampling steps, close to our teacher model (32 DDIM steps, FID = 10.05).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes