LGFeb 3

Efficient Training of Boltzmann Generators Using Off-Policy Log-Dispersion Regularization

Henrik Schopmans, Christopher von Klitzing, Pascal Friederich

arXiv:2602.03729v12.71 citationsh-index: 5

Originality Incremental advance

AI Analysis

This addresses the problem of costly simulation data and energy evaluations in computational science, offering an incremental improvement in training efficiency.

The paper tackles the challenge of data-efficient training for Boltzmann generators, which sample from physical system distributions, by proposing off-policy log-dispersion regularization (LDR) to improve performance and sample efficiency, achieving gains of up to one order of magnitude.

Sampling from unnormalized probability densities is a central challenge in computational science. Boltzmann generators are generative models that enable independent sampling from the Boltzmann distribution of physical systems at a given temperature. However, their practical success depends on data-efficient training, as both simulation data and target energy evaluations are costly. To this end, we propose off-policy log-dispersion regularization (LDR), a novel regularization framework that builds on a generalization of the log-variance objective. We apply LDR in the off-policy setting in combination with standard data-based training objectives, without requiring additional on-policy samples. LDR acts as a shape regularizer of the energy landscape by leveraging additional information in the form of target energy labels. The proposed regularization framework is broadly applicable, supporting unbiased or biased simulation datasets as well as purely variational training without access to target samples. Across all benchmarks, LDR improves both final performance and data efficiency, with sample efficiency gains of up to one order of magnitude.

View on arXiv PDF

Similar