LGJan 29

Elign: Equivariant Diffusion Model Alignment from Foundational Machine Learning Force Fields

Yunyang Li, Lin Huang, Luojia Xia, Wenhe Zhang, Mark Gerstein

arXiv:2601.21985v11.4h-index: 20

Originality Incremental advance

AI Analysis

This addresses computational bottlenecks in physics-guided generative models for molecular conformations, though it appears incremental as it builds on existing diffusion models and foundational ML force fields.

The paper tackles the problem of E(3)-equivariant diffusion models for 3D molecular conformations reproducing biases from training data rather than capturing equilibrium distributions, by introducing Elign, a post-training framework that amortizes computational costs. The result shows Elign generates conformations with lower DFT energies and forces while maintaining fast inference, with no energy evaluations required during generation.

Generative models for 3D molecular conformations must respect Euclidean symmetries and concentrate probability mass on thermodynamically favorable, mechanically stable structures. However, E(3)-equivariant diffusion models often reproduce biases from semi-empirical training data rather than capturing the equilibrium distribution of a high-fidelity Hamiltonian. While physics-based guidance can correct this, it faces two computational bottlenecks: expensive quantum-chemical evaluations (e.g., DFT) and the need to repeat such queries at every sampling step. We present Elign, a post-training framework that amortizes both costs. First, we replace expensive DFT evaluations with a faster, pretrained foundational machine-learning force field (MLFF) to provide physical signals. Second, we eliminate repeated run-time queries by shifting physical steering to the training phase. To achieve the second amortization, we formulate reverse diffusion as a reinforcement learning problem and introduce Force--Energy Disentangled Group Relative Policy Optimization (FED-GRPO) to fine-tune the denoising policy. FED-GRPO includes a potential-based energy reward and a force-based stability reward, which are optimized and group-normalized independently. Experiments show that Elign generates conformations with lower gold-standard DFT energies and forces, while improving stability. Crucially, inference remains as fast as unguided sampling, since no energy evaluations are required during generation.

View on arXiv PDF

Similar