CEMay 19

Uncertainty-aware Machine Learning Interatomic Potentials via Learned Functional Perturbations

arXiv:2605.1993973.8
AI Analysis

For researchers using MLIPs in molecular simulations, this provides a simpler and more effective uncertainty quantification method that improves predictive accuracy and reliability.

The authors propose a method to turn deterministic machine learning interatomic potentials into probabilistic ones via learned functional perturbations, finetuned with the Continuous Ranked Probability Score. On the N-body charged particle benchmark, their method improves CRPS by 19-32% over the state-of-the-art Bayesian method BLIP; on silica, it raises Spearman correlation from 0.75 to 0.84.

Machine Learning Interatomic Potentials (MLIPs) achieve near ab initio accuracy at a fraction of the cost of quantum-mechanical simulations, yet they remain prone to silent failures on out-of-distribution configurations, making principled uncertainty quantification (UQ) essential for error-aware simulations and active learning. Existing non-ensemble UQ methods for MLIPs rely either on variational inference or on parametric distributional assumptions, both of which add architectural complexity and hyper-parameters that must be tuned per task. Inspired by recent advances in probabilistic weather forecasting, we propose a simpler alternative: turn a deterministic MLIP into a probabilistic one through learned functional perturbations and finetune it end-to-end with the Continuous Ranked Probability Score (CRPS), a proper scoring rule. We validate the approach with an equivariant GNN (P-EGNN) trained from scratch and by finetuning the foundation model the Orb-v3 for silica. On the N-body charged particle benchmark, P-EGNN improves CRPS over the state-of-the-art Bayesian MLIP method BLIP by 19-32% across all training sizes; on silica, P-Orb raises the Spearman correlation between predicted uncertainty and actual error from 0.75 (BLIP-Orb) to 0.84.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes