NA LG COMP-PHDec 7, 2024

Finite Element Neural Network Interpolation. Part I: Interpretable and Adaptive Discretization for Solving PDEs

Kateřina Škardová, Alexandre Daby-Seesaram, Martin Genet

arXiv:2412.05719v15 citationsh-index: 4Has CodeComput Mech

Originality Incremental advance

AI Analysis

This work addresses computational challenges in PDE solving for engineering and scientific applications, offering incremental improvements to existing neural network-based methods.

The paper tackles solving PDEs by introducing the FENNI framework, a sparse neural network architecture that improves upon previous EFENN methods, resulting in enhanced efficiency and robustness in training, with demonstrated accuracy comparable to analytical solutions and classical FEM solvers on 1D and 2D test cases.

We present the Finite Element Neural Network Interpolation (FENNI) framework, a sparse neural network architecture extending previous work on Embedded Finite Element Neural Networks (EFENN) introduced with the Hierarchical Deep-learning Neural Networks (HiDeNN). Due to their mesh-based structure, EFENN requires significantly fewer trainable parameters than fully connected neural networks, with individual weights and biases having a clear interpretation. Our FENNI framework, within the EFENN framework, brings improvements to the HiDeNN approach. First, we propose a reference element-based architecture where shape functions are defined on a reference element, enabling variability in interpolation functions and straightforward use of Gaussian quadrature rules for evaluating the loss function. Second, we propose a pragmatic multigrid training strategy based on the framework's interpretability. Third, HiDeNN's combined rh-adaptivity is extended from 1D to 2D, with a new Jacobian-based criterion for adding nodes combining h- and r-adaptivity. From a deep learning perspective, adaptive mesh behavior through rh-adaptivity and the multigrid approach correspond to transfer learning, enabling FENNI to optimize the network's architecture dynamically during training. The framework's capabilities are demonstrated on 1D and 2D test cases, where its accuracy and computational cost are compared against an analytical solution and a classical FEM solver. On these cases, the multigrid training strategy drastically improves the training stage's efficiency and robustness. Finally, we introduce a variational loss within the EFENN framework, showing that it performs as well as energy-based losses and outperforms residual-based losses. This framework is extended to surrogate modeling over the parametric space in Part II.

View on arXiv PDF Code

Similar