NA LG CO MLMar 13, 2024

Derivative-informed neural operator acceleration of geometric MCMC for infinite-dimensional Bayesian inverse problems

Lianghao Cao, Thomas O'Leary-Roseberry, Omar Ghattas

arXiv:2403.08220v29.714 citationsh-index: 9Has Code

Originality Incremental advance

AI Analysis

This work addresses the high computational cost of solving parametric PDEs in Bayesian inverse problems, offering a significant speedup for researchers in computational science and engineering, though it is an incremental improvement on existing operator learning methods.

The authors tackled the computational bottleneck of geometric Markov chain Monte Carlo (MCMC) for infinite-dimensional Bayesian inverse problems by developing a derivative-informed neural operator (DINO) surrogate to accelerate proposals, achieving 3–9 times faster posterior sampling than geometric MCMC and 60–97 times faster than prior methods.

We propose an operator learning approach to accelerate geometric Markov chain Monte Carlo (MCMC) for solving infinite-dimensional Bayesian inverse problems (BIPs). While geometric MCMC employs high-quality proposals that adapt to posterior local geometry, it requires repeated computations of gradients and Hessians of the log-likelihood, which becomes prohibitive when the parameter-to-observable (PtO) map is defined through expensive-to-solve parametric partial differential equations (PDEs). We consider a delayed-acceptance geometric MCMC method driven by a neural operator surrogate of the PtO map, where the proposal exploits fast surrogate predictions of the log-likelihood and, simultaneously, its gradient and Hessian. To achieve a substantial speedup, the surrogate must accurately approximate the PtO map and its Jacobian, which often demands a prohibitively large number of PtO map samples via conventional operator learning methods. In this work, we present an extension of derivative-informed operator learning [O'Leary-Roseberry et al., J. Comput. Phys., 496 (2024)] that uses joint samples of the PtO map and its Jacobian. This leads to derivative-informed neural operator (DINO) surrogates that accurately predict the observables and posterior local geometry at a significantly lower training cost than conventional methods. Cost and error analysis for reduced basis DINO surrogates are provided. Numerical studies demonstrate that DINO-driven MCMC generates effective posterior samples 3--9 times faster than geometric MCMC and 60--97 times faster than prior geometry-based MCMC. Furthermore, the training cost of DINO surrogates breaks even compared to geometric MCMC after just 10--25 effective posterior samples.

View on arXiv PDF Code

Similar