CHEM-PHLGFeb 23

MACE-POLAR-1: A Polarisable Electrostatic Foundation Model for Molecular Chemistry

arXiv:2602.19411v19 citationsh-index: 22
Originality Highly original
AI Analysis

This work addresses the need for accurate and efficient electrostatic modeling in computational chemistry and drug discovery, representing a novel method for a known bottleneck rather than an incremental advancement.

The paper tackled the problem of accurately modeling electrostatic interactions and charge transfer in computational chemistry by developing a new electrostatic foundation model that extends the MACE architecture with explicit long-range interactions and electrostatic induction. The result is a model achieving chemical accuracy competitive with hybrid DFT across diverse benchmarks, including sub-kcal/mol prediction of molecular crystal formation energy and a fourfold improvement over short-ranged models on protein-ligand interactions.

Accurate modelling of electrostatic interactions and charge transfer is fundamental to computational chemistry, yet most machine learning interatomic potentials (MLIPs) rely on local atomic descriptors that cannot capture long-range electrostatic effects. We present a new electrostatic foundation model for molecular chemistry that extends the MACE architecture with explicit treatment of long-range interactions and electrostatic induction. Our approach combines local many-body geometric features with a non-self-consistent field formalism that updates learnable charge and spin densities through polarisable iterations to model induction, followed by global charge equilibration via learnable Fukui functions to control total charge and total spin. This design enables an accurate and physical description of systems with varying charge and spin states while maintaining computational efficiency. Trained on the OMol25 dataset of 100 million hybrid DFT calculations, our models achieve chemical accuracy across diverse benchmarks, with accuracy competitive with hybrid DFT on thermochemistry, reaction barriers, conformational energies, and transition metal complexes. Notably, we demonstrate that the inclusion of long-range electrostatics leads to a large improvement in the description of non-covalent interactions and supramolecular complexes over non-electrostatic models, including sub-kcal/mol prediction of molecular crystal formation energy in the X23-DMC dataset and a fourfold improvement over short-ranged models on protein-ligand interactions. The model's ability to handle variable charge and spin states, respond to external fields, provide interpretable spin-resolved charge densities, and maintain accuracy from small molecules to protein-ligand complexes positions it as a versatile tool for computational molecular chemistry and drug discovery.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes