LG OCJul 3, 2024

Convergence of Implicit Gradient Descent for Training Two-Layer Physics-Informed Neural Networks

Xianliang Xu, Ting Du, Wang Kong, Bin Shan, Ye Li, Zhongyi Huang

arXiv:2407.02827v32.62 citationsh-index: 2

Originality Incremental advance

AI Analysis

This provides a theoretical foundation for using IGD in PINNs, potentially improving optimization for multi-scale physics problems, though it is incremental as it builds on existing PINN and optimization methods.

The paper tackles the problem of training physics-informed neural networks (PINNs) by analyzing the convergence of implicit gradient descent (IGD), showing that it converges linearly to a globally optimal solution for over-parameterized two-layer PINNs, with empirical validation.

The optimization algorithms are crucial in training physics-informed neural networks (PINNs), as unsuitable methods may lead to poor solutions. Compared to the common gradient descent (GD) algorithm, implicit gradient descent (IGD) outperforms it in handling certain multi-scale problems. In this paper, we provide convergence analysis for the IGD in training over-parameterized two-layer PINNs. We first derive the training dynamics of IGD in training two-layer PINNs. Then, over-parameterization allows us to prove that the randomly initialized IGD converges to a globally optimal solution at a linear convergence rate. Moreover, due to the distinct training dynamics of IGD compared to GD, the learning rate can be selected independently of the sample size and the least eigenvalue of the Gram matrix. Additionally, the novel approach used in our convergence analysis imposes a milder requirement on the network width. Finally, empirical results validate our theoretical findings.

View on arXiv PDF

Similar