CHEM-PHLGCOMP-PHSep 8, 2021

Training Algorithm Matters for the Performance of Neural Network Potential: A Case Study of Adam and the Kalman Filter Optimizers

arXiv:2109.03769v313 citations
Originality Synthesis-oriented
AI Analysis

This addresses the choice of training algorithms for neural network potentials in computational chemistry, offering incremental insights for improved model robustness.

The study compared Adam and Extended Kalman Filter (EKF) optimizers for training neural network potentials (NNPs) on liquid water datasets, finding that EKF-trained NNPs are more transferable and less sensitive to learning rates, with performance correlating better with a Fisher information measure than validation errors.

One hidden yet important issue for developing neural network potentials (NNPs) is the choice of training algorithm. Here we compare the performance of two popular training algorithms, the adaptive moment estimation algorithm (Adam) and the Extended Kalman Filter algorithm (EKF), using the Behler-Parrinello neural network (BPNN) and two publicly accessible datasets of liquid water [Proc. Natl. Acad. Sci. U.S.A. 2016, 113, 8368-8373 and Proc. Natl. Acad. Sci. U.S.A. 2019, 116, 1110-1115]. This is achieved by implementing EKF in TensorFlow. It is found that NNPs trained with EKF are more transferable and less sensitive to the value of the learning rate, as compared to Adam. In both cases, error metrics of the validation set do not always serve as a good indicator for the actual performance of NNPs. Instead, we show that their performance correlates well with a Fisher information based similarity measure.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes