LGAICVFeb 4

Temporal Pair Consistency for Variance-Reduced Flow Matching

arXiv:2602.04908v1
Originality Highly original
AI Analysis

This work addresses inefficiencies in generative modeling for applications like image synthesis, though it is incremental as it builds on existing flow-matching frameworks.

The paper tackles the problem of high estimator variance and inefficient sampling in continuous-time generative models like flow matching by introducing Temporal Pair Consistency (TPC), a lightweight variance-reduction principle that couples velocity predictions at paired timesteps, resulting in improved sample quality and efficiency with lower FID scores on datasets such as CIFAR-10 and ImageNet at identical or lower computational cost.

Continuous-time generative models, such as diffusion models, flow matching, and rectified flow, learn time-dependent vector fields but are typically trained with objectives that treat timesteps independently, leading to high estimator variance and inefficient sampling. Prior approaches mitigate this via explicit smoothness penalties, trajectory regularization, or modified probability paths and solvers. We introduce Temporal Pair Consistency (TPC), a lightweight variance-reduction principle that couples velocity predictions at paired timesteps along the same probability path, operating entirely at the estimator level without modifying the model architecture, probability path, or solver. We provide a theoretical analysis showing that TPC induces a quadratic, trajectory-coupled regularization that provably reduces gradient variance while preserving the underlying flow-matching objective. Instantiated within flow matching, TPC improves sample quality and efficiency across CIFAR-10 and ImageNet at multiple resolutions, achieving lower FID at identical or lower computational cost than prior methods, and extends seamlessly to modern SOTA-style pipelines with noise-augmented training, score-based denoising, and rectified flow.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes