Performance of universal machine-learned potentials with explicit long-range interactions in biomolecular simulations

arXiv:2508.10841v1h-index: 14J Chem Theory Comput
Originality Synthesis-oriented
AI Analysis

This work addresses challenges in applying universal machine-learned potentials to biomolecular simulations, highlighting issues with imbalanced datasets and evaluation practices, which is incremental for computational chemistry and biophysics.

The study evaluated universal machine-learned potentials with explicit long-range interactions in biomolecular simulations, finding that larger models improved benchmark accuracy but not simulation properties, and long-range electrostatics had no systematic impact except increased conformational variability in Trp-cage.

Universal machine-learned potentials promise transferable accuracy across compositional and vibrational degrees of freedom, yet their application to biomolecular simulations remains underexplored. This work systematically evaluates equivariant message-passing architectures trained on the SPICE-v2 dataset with and without explicit long-range dispersion and electrostatics. We assess the impact of model size, training data composition, and electrostatic treatment across in- and out-of-distribution benchmark datasets, as well as molecular simulations of bulk liquid water, aqueous NaCl solutions, and biomolecules, including alanine tripeptide, the mini-protein Trp-cage, and Crambin. While larger models improve accuracy on benchmark datasets, this trend does not consistently extend to properties obtained from simulations. Predicted properties also depend on the composition of the training dataset. Long-range electrostatics show no systematic impact across systems. However, for Trp-cage, their inclusion yields increased conformational variability. Our results suggest that imbalanced datasets and immature evaluation practices currently challenge the applicability of universal machine-learned potentials to biomolecular simulations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes