LGCVFeb 13, 2025

Designing a Conditional Prior Distribution for Flow-Based Generative Models

arXiv:2502.09611v14 citationsh-index: 28Trans. Mach. Learn. Res.
Originality Highly original
AI Analysis

This work tackles the problem of inefficient conditional generation for users of flow-based generative models by reducing training times and improving sample quality with fewer steps.

This paper addresses the inefficiency of flow-based generative models in conditional generation by designing a conditional prior distribution. They map an input condition to an "average" data point and then use flow matching to map samples from a parametric distribution centered around this point to the conditional target distribution, significantly improving training times and generation efficiency.

Flow-based generative models have recently shown impressive performance for conditional generation tasks, such as text-to-image generation. However, current methods transform a general unimodal noise distribution to a specific mode of the target data distribution. As such, every point in the initial source distribution can be mapped to every point in the target distribution, resulting in long average paths. To this end, in this work, we tap into a non-utilized property of conditional flow-based models: the ability to design a non-trivial prior distribution. Given an input condition, such as a text prompt, we first map it to a point lying in data space, representing an ``average" data point with the minimal average distance to all data points of the same conditional mode (e.g., class). We then utilize the flow matching formulation to map samples from a parametric distribution centered around this point to the conditional target distribution. Experimentally, our method significantly improves training times and generation efficiency (FID, KID and CLIP alignment scores) compared to baselines, producing high quality samples using fewer sampling steps.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes