LGFeb 2, 2023Code
Convolutional Neural Operators for robust and accurate learning of PDEsBogdan Raonić, Roberto Molinaro, Tim De Ryck et al.
Although very successfully used in conventional machine learning, convolution based neural network architectures -- believed to be inconsistent in function space -- have been largely ignored in the context of learning solution operators of PDEs. Here, we present novel adaptations for convolutional neural networks to demonstrate that they are indeed able to process functions as inputs and outputs. The resulting architecture, termed as convolutional neural operators (CNOs), is designed specifically to preserve its underlying continuous nature, even when implemented in a discretized form on a computer. We prove a universality theorem to show that CNOs can approximate operators arising in PDEs to desired accuracy. CNOs are tested on a novel suite of benchmarks, encompassing a diverse set of PDEs with possibly multi-scale solutions and are observed to significantly outperform baselines, paving the way for an alternative framework for robust and accurate operator learning. Our code is publicly available at https://github.com/bogdanraonic3/ConvolutionalNeuralOperator
CAJan 28, 2016
Stability estimates for the regularized inversion of the truncated Hilbert transformRima Alaifari, Michel Defrise, Alexander Katsevich
In limited data computerized tomography, the 2D or 3D problem can be reduced to a family of 1D problems using the differentiated backprojection (DBP) method. Each 1D problem consists of recovering a compactly supported function $f \in L^2(\mathcal F)$, where $\mathcal F$ is a finite interval, from its partial Hilbert transform data. When the Hilbert transform is measured on a finite interval $\mathcal G$ that only overlaps but does not cover $\mathcal F$ this inversion problem is known to be severely ill-posed [1]. In this paper, we study the reconstruction of $f$ restricted to the overlap region $\mathcal F \cap \mathcal G$. We show that with this restriction and by assuming prior knowledge on the $L^2$ norm or on the variation of $f$, better stability with Hölder continuity (typical for mildly ill-posed problems) can be obtained.
IVJun 10, 2022
Localized adversarial artifacts for compressed sensing MRIRima Alaifari, Giovanni S. Alberti, Tandri Gauksson
As interest in deep neural networks (DNNs) for image reconstruction tasks grows, their reliability has been called into question (Antun et al., 2020; Gottschling et al., 2020). However, recent work has shown that, compared to total variation (TV) minimization, when appropriately regularized, DNNs show similar robustness to adversarial noise in terms of $\ell^2$-reconstruction error (Genzel et al., 2022). We consider a different notion of robustness, using the $\ell^\infty$-norm, and argue that localized reconstruction artifacts are a more relevant defect than the $\ell^2$-error. We create adversarial perturbations to undersampled magnetic resonance imaging measurements (in the frequency domain) which induce severe localized artifacts in the TV-regularized reconstruction. Notably, the same attack method is not as effective against DNN based reconstruction. Finally, we show that this phenomenon is inherent to reconstruction methods for which exact recovery can be guaranteed, as with compressed sensing reconstructions with $\ell^1$- or TV-minimization.
APApr 10
On $L^\infty$ stability for wave propagation and for linear inverse problemsRima Alaifari, Giovanni S. Alberti, Tandri Gauksson
Stability is a key property of both forward models and inverse problems, and depends on the norms considered in the relevant function spaces. For instance, stability estimates for hyperbolic partial differential equations are often based on energy conservation principles, and are therefore expressed in terms of $L^2$ norms. The focus of this paper is on stability with respect to the $L^\infty$ norm, which is more relevant to detect localized phenomena. The linear wave equation is not stable in $L^\infty$, and we design an alternative solution method based on the regularization of Fourier multipliers, which is stable in $L^\infty$. Furthermore, we show how these ideas can be extended to inverse problems, and design a regularization method for the inversion of compact operators that is stable in $L^\infty$. We also discuss the connection with the stability of deep neural networks modeled by hyperbolic PDEs.
NAFeb 27, 2025
Scalable Signature Kernel Computations for Long Time Series via Local Neumann Series ExpansionsMatthew Tamayo-Rios, Alexander Schell, Rima Alaifari
The signature kernel is a recent state-of-the-art tool for analyzing high-dimensional sequential data, valued for its theoretical guarantees and strong empirical performance. In this paper, we present a novel method for efficiently computing the signature kernel of long, high-dimensional time series via adaptively truncated recursive local power series expansions. Building on the characterization of the signature kernel as the solution of a Goursat PDE, our approach employs tilewise Neumann-series expansions to derive rapidly converging power series approximations of the signature kernel that are locally defined on subdomains and propagated iteratively across the entire domain of the Goursat solution by exploiting the geometry of the time series. Algorithmically, this involves solving a system of interdependent Goursat PDEs via adaptively truncated local power series expansions and recursive propagation of boundary conditions along a directed graph in a topological ordering. This method strikes an effective balance between computational cost and accuracy, achieving substantial performance improvements over state-of-the-art approaches for computing the signature kernel. It offers (a) adjustable and superior accuracy, even for time series with very high roughness; (b) drastically reduced memory requirements; and (c) scalability to efficiently handle very long time series (one million data points or more) on a single GPU. As demonstrated in our benchmarks, these advantages make our method particularly well-suited for rough-path-assisted machine learning, financial modeling, and signal processing applications involving very long and highly volatile sequential data.
LGMay 31, 2023
Representation Equivalent Neural Operators: a Framework for Alias-free Operator LearningFrancesca Bartolucci, Emmanuel de Bézenac, Bogdan Raonić et al.
Recently, operator learning, or learning mappings between infinite-dimensional function spaces, has garnered significant attention, notably in relation to learning partial differential equations from data. Conceptually clear when outlined on paper, neural operators necessitate discretization in the transition to computer implementations. This step can compromise their integrity, often causing them to deviate from the underlying operators. This research offers a fresh take on neural operators with a framework Representation equivalent Neural Operators (ReNO) designed to address these issues. At its core is the concept of operator aliasing, which measures inconsistency between neural operators and their discrete representations. We explore this for widely-used operator learning techniques. Our findings detail how aliasing introduces errors when handling different discretizations and grids and loss of crucial continuous structures. More generally, this framework not only sheds light on existing challenges but, given its constructive and broad nature, also potentially offers tools for developing new neural operators.
CVApr 20, 2018
ADef: an Iterative Algorithm to Construct Adversarial DeformationsRima Alaifari, Giovanni S. Alberti, Tandri Gauksson
While deep neural networks have proven to be a powerful tool for many recognition and classification tasks, their stability properties are still not well understood. In the past, image classifiers have been shown to be vulnerable to so-called adversarial attacks, which are created by additively perturbing the correctly classified image. In this paper, we propose the ADef algorithm to construct a different kind of adversarial attack created by iteratively applying small deformations to the image, found through a gradient descent step. We demonstrate our results on MNIST with convolutional neural networks and on ImageNet with Inception-v3 and ResNet-101.