Learning Neural PDE Solvers with Parameter-Guided Channel Attention

arXiv:2304.14118v242 citationsh-index: 35
Originality Incremental advance
AI Analysis

This work addresses a key limitation in scientific machine learning for domains like weather forecasting and molecular dynamics, offering an incremental improvement in adapting neural surrogates to varying PDE parameters.

The paper tackles the problem of neural PDE solvers failing to generalize to unseen PDE parameters by proposing a parameter-guided channel attention module (CAPE) and a curriculum learning strategy, achieving consistent and significant improvements over baseline models on a popular PDE benchmark.

Scientific Machine Learning (SciML) is concerned with the development of learned emulators of physical systems governed by partial differential equations (PDE). In application domains such as weather forecasting, molecular dynamics, and inverse design, ML-based surrogate models are increasingly used to augment or replace inefficient and often non-differentiable numerical simulation algorithms. While a number of ML-based methods for approximating the solutions of PDEs have been proposed in recent years, they typically do not adapt to the parameters of the PDEs, making it difficult to generalize to PDE parameters not seen during training. We propose a Channel Attention mechanism guided by PDE Parameter Embeddings (CAPE) component for neural surrogate models and a simple yet effective curriculum learning strategy. The CAPE module can be combined with neural PDE solvers allowing them to adapt to unseen PDE parameters. The curriculum learning strategy provides a seamless transition between teacher-forcing and fully auto-regressive training. We compare CAPE in conjunction with the curriculum learning strategy using a popular PDE benchmark and obtain consistent and significant improvements over the baseline models. The experiments also show several advantages of CAPE, such as its increased ability to generalize to unseen PDE parameters without large increases inference time and parameter count.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes