CVAILGApr 28, 2025

Integration Flow Models

arXiv:2504.20179v13 citationsh-index: 3
Originality Highly original
AI Analysis

This addresses a key bottleneck for researchers and practitioners using ODE-based generative models, offering a novel method to enhance stability and accuracy, though it appears incremental as it builds on existing models like diffusion models and Rectified Flows.

The paper tackles the problem of discretization errors and training instability in ODE-based generative models by proposing Integration Flow, which directly learns the integral of ODE trajectories and incorporates the target state as an anchor, achieving improved performance with FIDs as low as 2.86 on CIFAR10 and 4.09 on ImageNet.

Ordinary differential equation (ODE) based generative models have emerged as a powerful approach for producing high-quality samples in many applications. However, the ODE-based methods either suffer the discretization error of numerical solvers of ODE, which restricts the quality of samples when only a few NFEs are used, or struggle with training instability. In this paper, we proposed Integration Flow, which directly learns the integral of ODE-based trajectory paths without solving the ODE functions. Moreover, Integration Flow explicitly incorporates the target state $\mathbf{x}_0$ as the anchor state in guiding the reverse-time dynamics. We have theoretically proven this can contribute to both stability and accuracy. To the best of our knowledge, Integration Flow is the first model with a unified structure to estimate ODE-based generative models and the first to show the exact straightness of 1-Rectified Flow without reflow. Through theoretical analysis and empirical evaluations, we show that Integration Flows achieve improved performance when it is applied to existing ODE-based models, such as diffusion models, Rectified Flows, and PFGM++. Specifically, Integration Flow achieves one-step generation on CIFAR10 with FIDs of 2.86 for the Variance Exploding (VE) diffusion model, 3.36 for rectified flow without reflow, and 2.91 for PFGM++; and on ImageNet with FIDs of 4.09 for VE diffusion model, 4.35 for rectified flow without reflow and 4.15 for PFGM++.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes