LGMay 27, 2023

Scalable Transformer for PDE Surrogate Modeling

arXiv:2305.17560v2155 citations
Originality Incremental advance
AI Analysis

This addresses the problem of scaling Transformer-based surrogate models for multi-dimensional PDEs, offering a computationally efficient solution for researchers and practitioners in computational physics and engineering.

The paper tackles the computational instability and expense of applying Transformers to large-grid PDE surrogate modeling by proposing Factorized Transformer (FactFormer), which uses an axial factorized kernel integral to simulate 2D Kolmogorov flow on a 256x256 grid and 3D smoke buoyancy on a 64x64x64 grid with good accuracy and efficiency.

Transformer has shown state-of-the-art performance on various applications and has recently emerged as a promising tool for surrogate modeling of partial differential equations (PDEs). Despite the introduction of linear-complexity attention, applying Transformer to problems with a large number of grid points can be numerically unstable and computationally expensive. In this work, we propose Factorized Transformer (FactFormer), which is based on an axial factorized kernel integral. Concretely, we introduce a learnable projection operator that decomposes the input function into multiple sub-functions with one-dimensional domain. These sub-functions are then evaluated and used to compute the instance-based kernel with an axial factorized scheme. We showcase that the proposed model is able to simulate 2D Kolmogorov flow on a $256\times 256$ grid and 3D smoke buoyancy on a $64\times64\times64$ grid with good accuracy and efficiency. The proposed factorized scheme can serve as a computationally efficient low-rank surrogate for the full attention scheme when dealing with multi-dimensional problems.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes