A recipe for scalable attention-based MLIPs: unlocking long-range accuracy with all-to-all node attention

arXiv:2603.06567v12 citations
Predicted impact top 17% in LG · last 90 daysOriginality Highly original
AI Analysis

This work addresses the challenge of accurately capturing long-range interactions in MLIPs for large systems like biomolecules and electrolytes, offering a data-driven solution for researchers in computational chemistry and materials science.

The paper introduces AllScAIP, an attention-based machine-learning interatomic potential (MLIP) model designed to capture long-range interactions effectively. It achieves state-of-the-art energy/force accuracy on molecular systems and competitive performance on materials and catalysts, while enabling stable molecular dynamics simulations that accurately recover experimental observables like density and heat of vaporization.

Machine-learning interatomic potentials (MLIPs) have advanced rapidly, with many top models relying on strong physics-based inductive biases. However, as models scale to larger systems like biomolecules and electrolytes, they struggle to accurately capture long-range (LR) interactions, leading current approaches to rely on explicit physics-based terms or components. In this work, we propose AllScAIP, a straightforward, attention-based, and energy-conserving MLIP model that scales to O(100 million) training samples. It addresses the long-range challenge using an all-to-all node attention component that is data-driven. Extensive ablations reveal that in low-data/small-model regimes, inductive biases improve sample efficiency. However, as data and model size scale, these benefits diminish or even reverse, while all-to-all attention remains critical for capturing LR interactions. Our model achieves state-of-the-art energy/force accuracy on molecular systems, as well as a number of physics-based evaluations (OMol25), while being competitive on materials (OMat24) and catalysts (OC20). Furthermore, it enables stable, long-timescale MD simulations that accurately recover experimental observables, including density and heat of vaporization predictions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes