LGAIMLApr 14, 2025

Energy Matching: Unifying Flow Matching and Energy-Based Models for Generative Modeling

arXiv:2504.10612v522 citationsh-index: 69
Originality Highly original
AI Analysis

This work addresses a key problem in generative modeling for researchers and practitioners by providing a more flexible and efficient method that outperforms existing energy-based models, though it builds incrementally on prior approaches.

The paper tackles the limitation of flow-based generative models in integrating partial observations and priors by proposing Energy Matching, a framework that combines flow matching with energy-based models, achieving state-of-the-art performance on CIFAR-10 and ImageNet generation while enabling applications like protein generation.

Current state-of-the-art generative models map noise to data distributions by matching flows or scores. A key limitation of these models is their inability to readily integrate available partial observations and additional priors. In contrast, energy-based models (EBMs) address this by incorporating corresponding scalar energy terms. Here, we propose Energy Matching, a framework that endows flow-based approaches with the flexibility of EBMs. Far from the data manifold, samples move from noise to data along irrotational, optimal transport paths. As they approach the data manifold, an entropic energy term guides the system into a Boltzmann equilibrium distribution, explicitly capturing the underlying likelihood structure of the data. We parameterize these dynamics with a single time-independent scalar field, which serves as both a powerful generator and a flexible prior for effective regularization of inverse problems. The present method substantially outperforms existing EBMs on CIFAR-10 and ImageNet generation in terms of fidelity, while retaining simulation-free training of transport-based approaches away from the data manifold. Furthermore, we leverage the flexibility of the method to introduce an interaction energy that supports the exploration of diverse modes, which we demonstrate in a controlled protein generation setting. This approach learns a scalar potential energy, without time conditioning, auxiliary generators, or additional networks, marking a significant departure from recent EBM methods. We believe this simplified yet rigorous formulation significantly advances EBMs capabilities and paves the way for their wider adoption in generative modeling in diverse domains.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes