Performance of Neural and Polynomial Operator Surrogates

Josephine Westermann, Benno Huber, Thomas O'Leary-Roseberry, Jakob Zech

arXiv:2604.0068953.04 citationsh-index: 2

AI Analysis

This work provides practical guidance for selecting surrogate methods in computational science and engineering, based on input regularity and computational constraints, but it is incremental as it compares existing techniques without introducing new paradigms.

The paper systematically compares neural and polynomial operator surrogates for parametric PDEs, finding that polynomial methods are more data-efficient for smooth inputs (e.g., convergence rates align with theory for s ≥ 2), while Fourier neural operators converge faster for rough inputs (s ≤ 1), with derivative-informed training improving data efficiency in low-data regimes.

We consider the problem of constructing surrogate operators for parameter-to-solution maps arising from parametric partial differential equations, where repeated forward model evaluations are computationally expensive. We present a systematic empirical comparison of neural operator surrogates, including a reduced-basis neural operator trained with $L^2_Î¼$ and $H^1_Î¼$ objectives and the Fourier neural operator, against polynomial surrogate methods, specifically a reduced-basis sparse-grid surrogate and a reduced-basis tensor-train surrogate. All methods are evaluated on a linear parametric diffusion problem and a nonlinear parametric hyperelasticity problem, using input fields with algebraically decaying spectral coefficients at varying rates of decay $s$. To enable fair comparisons, we analyze ensembles of surrogate models generated by varying hyperparameters and compare the resulting Pareto frontiers of cost versus approximation accuracy, decomposing cost into contributions from data generation, setup, and evaluation. Our results show that no single method is universally superior. Polynomial surrogates achieve substantially better data efficiency for smooth input fields ($s \geq 2$), with convergence rates for the sparse-grid surrogate in agreement with theoretical predictions. For rough inputs ($s \leq 1$), the Fourier neural operator displays the fastest convergence rates. Derivative-informed training consistently improves data efficiency over standard $L^2_Î¼$ training, providing a competitive alternative for rough inputs in the low-data regime when Jacobian information is available at reasonable cost. These findings highlight the importance of matching the surrogate methodology to the regularity of the problem as well as accuracy demands and computational constraints of the application.

View on arXiv PDF

Similar