LGOct 19, 2023

Improved Operator Learning by Orthogonal Attention

arXiv:2310.12487v455 citationsh-index: 18
Originality Incremental advance
AI Analysis

This addresses overfitting issues in scientific machine learning for PDE modeling, though it appears incremental as an enhancement to existing attention-based neural operators.

The paper tackles overfitting in attention-based neural operators for PDE solutions by developing orthogonal attention based on eigendecomposition and neural approximation of eigenfunctions, achieving performance improvements on six benchmark datasets.

Neural operators, as an efficient surrogate model for learning the solutions of PDEs, have received extensive attention in the field of scientific machine learning. Among them, attention-based neural operators have become one of the mainstreams in related research. However, existing approaches overfit the limited training data due to the considerable number of parameters in the attention mechanism. To address this, we develop an orthogonal attention based on the eigendecomposition of the kernel integral operator and the neural approximation of eigenfunctions. The orthogonalization naturally poses a proper regularization effect on the resulting neural operator, which aids in resisting overfitting and boosting generalization. Experiments on six standard neural operator benchmark datasets comprising both regular and irregular geometries show that our method can outperform competing baselines with decent margins.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes