CHEM-PHLGCOMP-PHAug 17, 2023

Accurate machine learning force fields via experimental and simulation data fusion

arXiv:2308.09142v138 citationsh-index: 19
Originality Incremental advance
AI Analysis

This incremental approach addresses the challenge of scarce and erroneous data for materials scientists, enabling more accurate molecular models for any material.

The researchers tackled the problem of inaccurate machine learning force fields by fusing Density Functional Theory calculations with experimental data to train a titanium potential, achieving higher accuracy across all target properties compared to single-source models.

Machine Learning (ML)-based force fields are attracting ever-increasing interest due to their capacity to span spatiotemporal scales of classical interatomic potentials at quantum-level accuracy. They can be trained based on high-fidelity simulations or experiments, the former being the common case. However, both approaches are impaired by scarce and erroneous data resulting in models that either do not agree with well-known experimental observations or are under-constrained and only reproduce some properties. Here we leverage both Density Functional Theory (DFT) calculations and experimentally measured mechanical properties and lattice parameters to train an ML potential of titanium. We demonstrate that the fused data learning strategy can concurrently satisfy all target objectives, thus resulting in a molecular model of higher accuracy compared to the models trained with a single data source. The inaccuracies of DFT functionals at target experimental properties were corrected, while the investigated off-target properties remained largely unperturbed. Our approach is applicable to any material and can serve as a general strategy to obtain highly accurate ML potentials.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes