CHEM-PH LG COMP-PHSep 8, 2021

Training Algorithm Matters for the Performance of Neural Network Potential: A Case Study of Adam and the Kalman Filter Optimizers

Yunqi Shao, Florian M. Dietrich, Carl Nettelblad, Chao Zhang

arXiv:2109.03769v31.213 citations

Originality Synthesis-oriented

AI Analysis

This addresses the choice of training algorithms for neural network potentials in computational chemistry, offering incremental insights for improved model robustness.

The study compared Adam and Extended Kalman Filter (EKF) optimizers for training neural network potentials (NNPs) on liquid water datasets, finding that EKF-trained NNPs are more transferable and less sensitive to learning rates, with performance correlating better with a Fisher information measure than validation errors.

One hidden yet important issue for developing neural network potentials (NNPs) is the choice of training algorithm. Here we compare the performance of two popular training algorithms, the adaptive moment estimation algorithm (Adam) and the Extended Kalman Filter algorithm (EKF), using the Behler-Parrinello neural network (BPNN) and two publicly accessible datasets of liquid water [Proc. Natl. Acad. Sci. U.S.A. 2016, 113, 8368-8373 and Proc. Natl. Acad. Sci. U.S.A. 2019, 116, 1110-1115]. This is achieved by implementing EKF in TensorFlow. It is found that NNPs trained with EKF are more transferable and less sensitive to the value of the learning rate, as compared to Adam. In both cases, error metrics of the validation set do not always serve as a good indicator for the actual performance of NNPs. Instead, we show that their performance correlates well with a Fisher information based similarity measure.

View on arXiv PDF

Similar