LGMay 13

Frequency Bias and OOD Generalization in Neural Operators under a Variable-Coefficient Wave Equation

arXiv:2605.1299736.0

AI Analysis

For researchers developing neural operators for PDE simulations, this work reveals architectural representation biases that limit generalization under distribution shifts, underscoring the need for more robust designs.

The paper investigates out-of-distribution generalization of neural operators (FNO and DeepONet) under frequency and coefficient smoothness shifts in a wave equation setting. FNO shows sharp error increase under high-frequency shifts, while DeepONet degrades more mildly, highlighting a gap between in-distribution performance and OOD generalization.

Neural operators learn to map initial conditions to the terminal solution of partial differential equations (PDEs), providing a surrogate for the full operator mapping. This enables rapid prediction across different input configurations. While recent neural operator architectures have demonstrated strong performance on diverse PDE tasks, their behavior under structured distribution shifts remains insufficiently understood. To investigate this, we study operator learning in a wave propagation setting governed by a one-dimensional variable-coefficient wave equation, using two representative architectures, the Fourier Neural Operator (FNO) and the Deep Operator Network (DeepONet). To examine their generalization under distribution shifts, we consider structured out-of-distribution (OOD) settings that independently vary input frequency and coefficient smoothness. The results show that under smoothness shifts, both models maintain stable performance, with FNO achieving lower error. In contrast, under frequency shifts, FNO exhibits a sharp increase in error under unseen high-frequency inputs, whereas DeepONet shows milder degradation despite higher overall error. Our analysis reveals that these differences arise from how each architecture represents and responds to variations in frequency structure. Together, these findings highlight a fundamental gap between strong in-distribution performance and generalization under distribution shifts in operator learning, underscoring the role of architectural representation bias in developing more reliable neural operators for physics-based PDE simulations beyond the training distribution.

View on arXiv PDF

Similar