LG NA OCApr 17, 2024

End-to-End Mesh Optimization of a Hybrid Deep Learning Black-Box PDE Solver

Shaocong Ma, James Diffenderfer, Bhavya Kailkhura, Yi Zhou

arXiv:2404.11766v24.61 citationsh-index: 42

Originality Incremental advance

AI Analysis

This work addresses the problem of integrating non-differentiable PDE solvers into deep learning pipelines for computational fluid dynamics, offering an incremental improvement for researchers in this domain.

The study tackled the challenge of end-to-end training a hybrid model combining a black-box PDE solver with a deep learning model for fluid flow prediction, using a zeroth-order gradient estimator to enable differentiation; it resulted in outperforming a baseline with a frozen mesh and showed accelerated convergence and improved generalization with warm-starting.

Deep learning has been widely applied to solve partial differential equations (PDEs) in computational fluid dynamics. Recent research proposed a PDE correction framework that leverages deep learning to correct the solution obtained by a PDE solver on a coarse mesh. However, end-to-end training of such a PDE correction model over both solver-dependent parameters such as mesh parameters and neural network parameters requires the PDE solver to support automatic differentiation through the iterative numerical process. Such a feature is not readily available in many existing solvers. In this study, we explore the feasibility of end-to-end training of a hybrid model with a black-box PDE solver and a deep learning model for fluid flow prediction. Specifically, we investigate a hybrid model that integrates a black-box PDE solver into a differentiable deep graph neural network. To train this model, we use a zeroth-order gradient estimator to differentiate the PDE solver via forward propagation. Although experiments show that the proposed approach based on zeroth-order gradient estimation underperforms the baseline that computes exact derivatives using automatic differentiation, our proposed method outperforms the baseline trained with a frozen input mesh to the solver. Moreover, with a simple warm-start on the neural network parameters, we show that models trained by these zeroth-order algorithms achieve an accelerated convergence and improved generalization performance.

View on arXiv PDF

Similar