Andreas Rupp

NA
h-index38
12papers
15citations
Novelty41%
AI Score49

12 Papers

63.1NAMay 28
A numerical method for the fractional Zakharov-Kuznetsov equation

Mukul Dwivedi, Andreas Rupp

This paper develops a fully discrete Fourier spectral Galerkin (FSG) method for the fractional Zakharov--Kuznetsov (fZK) equation posed on a two-dimensional periodic domain. The equation generalizes the classical ZK model by replacing the Laplacian with a fractional Laplacian of order \(α\in(0,2]\), thereby covering the classical ZK equation \(α=2\), the higher-dimensional Benjamin--Ono--ZK equation \(α=1\), and weaker fractional-dispersion regimes \(0<α<1\). We first propose a semi-discrete FSG scheme in space that preserves the discrete analogues of mass, momentum, and Hamiltonian energy. Using periodic Kato--Ponce product and commutator estimates, we prove local-in-time uniform Sobolev bounds and strong convergence of the semi-discrete approximations to the unique strong solution in \(C([0,\bar T];L^2_{\mathrm{per}}(Ω))\), for the initial condition in \(H^s_{\mathrm{per}}(Ω)\), \(s\geq 2+α\), and, as by product, we show that the existence and uniqueness of fZK equation in \(L^\infty(0,\bar T;H^s_{\mathrm{per}}(Ω))\cap W^{1,\infty}(0,\bar T;L^2_{\mathrm{per}}(Ω))\). We then introduce a modified projection adapted to the fractional transport dispersive operator and prove optimal spatial error estimates of order \(\mathcal O(N^{-r})\) for \(r>2+α\), together with exponential convergence for analytic solutions. An integrating-factor fourth-order four-stage Runge--Kutta time discretization is used to integrate the stiff fractional dispersive part exactly, and a fourth-order temporal error estimate is obtained under a high-regularity nonlinear stability assumption. Numerical experiments illustrate the accuracy, fractional-order dependence, and fully discrete conservation drift of the method.

NAJun 11, 2018
Discontinuous Galerkin method for coupling hydrostatic free surface flows to saturated subsurface systems

Andreas Rupp, Vadym Aizinger, Balthasar Reuter et al.

We formulate a coupled surface/subsurface flow model that relies on hydrostatic equations with free surface in the free flow domain and on the Darcy model in the subsurface part. The model is discretized using the local discontinuous Galerkin method, and a statement of discrete energy stability is proved for the fully non-linear coupled system.

CVDec 24, 2025Code
PUFM++: Point Cloud Upsampling via Enhanced Flow Matching

Zhi-Song Liu, Chenhang He, Roland Maier et al.

Recent advances in generative modeling have demonstrated strong promise for high-quality point cloud upsampling. In this work, we present PUFM++, an enhanced flow-matching framework for reconstructing dense and accurate point clouds from sparse, noisy, and partial observations. PUFM++ improves flow matching along three key axes: (i) geometric fidelity, (ii) robustness to imperfect input, and (iii) consistency with downstream surface-based tasks. We introduce a two-stage flow-matching strategy that first learns a direct, straight-path flow from sparse inputs to dense targets, and then refines it using noise-perturbed samples to approximate the terminal marginal distribution better. To accelerate and stabilize inference, we propose a data-driven adaptive time scheduler that improves sampling efficiency based on interpolation behavior. We further impose on-manifold constraints during sampling to ensure that generated points remain aligned with the underlying surface. Finally, we incorporate a recurrent interface network~(RIN) to strengthen hierarchical feature interactions and boost reconstruction quality. Extensive experiments on synthetic benchmarks and real-world scans show that PUFM++ sets a new state of the art in point cloud upsampling, delivering superior visual fidelity and quantitative accuracy across a wide range of tasks. Code and pretrained models are publicly available at https://github.com/Holmes-Alan/Enhanced_PUFM.

IVAug 29, 2024
Downscaling Neural Network for Coastal Simulations

Zhi-Song Liu, Markus Büttner, Matthew Scarborough et al.

Learning the fine-scale details of a coastal ocean simulation from a coarse representation is a challenging task. For real-world applications, high-resolution simulations are necessary to advance understanding of many coastal processes, specifically, to predict flooding resulting from tsunamis and storm surges. We propose a Downscaling Neural Network for Coastal Simulation (DNNCS) for spatiotemporal enhancement to learn the high-resolution numerical solution. Given images of coastal simulations produced on low-resolution computational meshes using low polynomial order discontinuous Galerkin discretizations and a coarse temporal resolution, the proposed DNNCS learns to produce high-resolution free surface elevation and velocity visualizations in both time and space. To model the dynamic changes over time and space, we propose grid-aware spatiotemporal attention to project the temporal features to the spatial domain for non-local feature matching. The coordinate information is also utilized via positional encoding. For the final reconstruction, we use the spatiotemporal bilinear operation to interpolate the missing frames and then expand the feature maps to the frequency domain for residual mapping. Besides data-driven losses, the proposed physics-informed loss guarantees gradient consistency and momentum changes, leading to a 24% reduction in root-mean-square error compared to the model trained with only data-driven losses. To train the proposed model, we propose a coastal simulation dataset and use it for model optimization and evaluation. Our method shows superior downscaling quality and fast computation compared to the state-of-the-art methods.

85.5NAApr 5
Sufficient conditions for QMC analysis of finite elements for parametric differential equations

Vesa Kaarnioja, Andreas Rupp, Jay Gopalakrishnan

Parametric regularity of discretizations of flux vector fields satisfying a balance law is studied under some assumptions on a random parameter that links the flux with an unknown primal variable (often through a constitutive law). In the primary example of the stationary diffusion equation, the parameter corresponds to the inverse of the diffusivity. The random parameter is modeled here as a Gevrey-regular random field. Specific focus is on random fields expressible as functions of countably infinite sequences of independent random variables, which may be uniformly or normally distributed. Quasi-Monte Carlo (QMC) error bounds for some quantity of interest that depends on the flux are then derived using the parametric regularity. It is shown that the QMC method achieves a dimension-independent, faster-than-Monte Carlo convergence rate if the quantity of interest depends continuously on the primal variable, its flux, or its gradient. A series of assumptions are introduced with the goal of encompassing a broad class of discretizations by various finite element methods. The assumptions are verified for the diffusion equation discretized using conforming finite elements, mixed methods, and hybridizable discontinuous Galerkin schemes. Numerical experiments confirm the analytical findings, highlighting the role of accurate flux approximation in QMC methods.

37.8NAMar 27
A Galerkin Finite Element Method for the Fractional Calderón Problem

Mukul Dwivedi, Jesse Railo, Andreas Rupp

We study a numerical reconstruction strategy for the potential in the fractional Calderón problem from a single partial exterior measurement. The forward model is the fractional Schrödinger equation in a bounded domain, with prescribed exterior Dirichlet datum and corresponding measurement of the exterior flux in an open observation set. Motivated by single-measurement uniqueness results based on unique continuation \cite{ghosh2020uniqueness}, we propose a decomposition strategy and a Galerkin--Tikhonov method to recover the potential by a stabilized least-squares quotient in a dedicated coefficient space. We prove the existence and uniqueness of the discrete reconstructor and establish conditional convergence under natural consistency and parameter choice assumptions. We further derive {\it a priori} error estimates for the reconstructed state and for the coefficient reconstruction, and combine the latter with logarithmic stability for the continuous inverse problem to obtain a total coefficient error bound. The framework cleanly separates the forward solver from the inverse reconstruction step and is compatible with practical truncation and quadrature schemes for the integral fractional Laplacian. Numerical experiments in one and two space dimensions illustrate stability with respect to noise and demonstrate reconstructions of both smooth and discontinuous potentials.

50.2LGMar 12
Inverse Neural Operator for ODE Parameter Optimization

Zhi-Song Liu, Wenqing Peng, Helmi Toropainen et al.

We propose the Inverse Neural Operator (INO), a two-stage framework for recovering hidden ODE parameters from sparse, partial observations. In Stage 1, a Conditional Fourier Neural Operator (C-FNO) with cross-attention learns a differentiable surrogate that reconstructs full ODE trajectories from arbitrary sparse inputs, suppressing high-frequency artifacts via spectral regularization. In Stage 2, an Amortized Drifting Model (ADM) learns a kernel-weighted velocity field in parameter space, transporting random parameter initializations toward the ground truth without backpropagating through the surrogate, avoiding the Jacobian instabilities that afflict gradient-based inversion in stiff regimes. Experiments on a real-world stiff atmospheric chemistry benchmark (POLLU, 25 parameters) and a synthetic Gene Regulatory Network (GRN, 40 parameters) show that INO outperforms gradient-based and amortized baselines in parameter recovery accuracy while requiring only 0.23s inference time, a 487x speedup over iterative gradient descent.

34.9NAApr 6
A Convergent Hybridizable Discontinuous Galerkin Method for Einstein--Scalar Equations

Mukul Dwivedi, Andreas Rupp

We propose and analyze a hybridized discontinuous Galerkin (HDG) method for the spherically symmetric Einstein--scalar system in Bondi gauge. After rewriting the model as a local first-order PDE--ODE system by introducing suitable scaled variables, we construct a semidiscrete scheme in which the element unknowns are computed locally and the coupling is carried by traces on the mesh skeleton. In the present radial setting, these traces can be eliminated recursively, so that only the main evolution variable is advanced in time, while the metric variables are recovered from discrete constraint relations. We prove local semidiscrete well-posedness, derive a global \(L^2\)--stability estimate, establish an optimal order \(L^2\) error bound for the main evolution variable for polynomial degree \(k\ge 1\), and obtain reconstruction error estimates for the metric variables and the associated mass functional. Numerical experiments verify the predicted spatial convergence rate and illustrate qualitative features of the Einstein--scalar dynamics, including large-data collapse profiles and smooth-pulse evolution.

LGMar 4, 2025
Cellular Automaton With CNN

Valery Ashu, Zhisong Liu, Heikki Haario et al.

Cellular automata (CA) models are widely used to simulate complex systems with emergent behaviors, but identifying hidden parameters that govern their dynamics remains a significant challenge. This study explores the use of Convolutional Neural Networks (CNN) to identify jump parameters in a two-dimensional CA model. We propose a custom CNN architecture trained on CA-generated data to classify jump parameters, which dictates the neighborhood size and movement rules of cells within the CA. Experiments were conducted across varying domain sizes (25 x 25 to 150 x 150) and CA iterations (0 to 50), demonstrating that the accuracy improves with larger domain sizes, as they provide more spatial information for parameter estimation. Interestingly, while initial CA iterations enhance the performance, increasing the number of iterations beyond a certain threshold does not significantly improve accuracy, suggesting that only specific temporal information is relevant for parameter identification. The proposed CNN achieves competitive accuracy (89.31) compared to established architectures like LeNet-5 and AlexNet, while offering significantly faster inference times, making it suitable for real-time applications. This study highlights the potential of CNNs as a powerful tool for fast and accurate parameter estimation in CA models, paving the way for their use in more complex systems and higher-dimensional domains. Future work will explore the identification of multiple hidden parameters and extend the approach to three-dimensional CA models.

LGMar 7, 2025
phepy: Visual Benchmarks and Improvements for Out-of-Distribution Detectors

Juniper Tyree, Andreas Rupp, Petri S. Clusius et al.

Applying machine learning to increasingly high-dimensional problems with sparse or biased training data increases the risk that a model is used on inputs outside its training domain. For such out-of-distribution (OOD) inputs, the model can no longer make valid predictions, and its error is potentially unbounded. Testing OOD detection methods on real-world datasets is complicated by the ambiguity around which inputs are in-distribution (ID) or OOD. We design a benchmark for OOD detection, which includes three novel and easily-visualisable toy examples. These simple examples provide direct and intuitive insight into whether the detector is able to detect (1) linear and (2) non-linear concepts and (3) identify thin ID subspaces (needles) within high-dimensional spaces (haystacks). We use our benchmark to evaluate the performance of various methods from the literature. Since tactile examples of OOD inputs may benefit OOD detection, we also review several simple methods to synthesise OOD inputs for supervised training. We introduce two improvements, $t$-poking and OOD sample weighting, to make supervised detectors more precise at the ID-OOD boundary. This is especially important when conflicts between real ID and synthetic OOD sample blur the decision boundary. Finally, we provide recommendations for constructing and applying out-of-distribution detectors in machine learning.

CVFeb 17, 2025
Data-Efficient Limited-Angle CT Using Deep Priors and Regularization

Ilmari Vahteristo, Zhi-Song Liu, Andreas Rupp

Reconstructing an image from its Radon transform is a fundamental computed tomography (CT) task arising in applications such as X-ray scans. In many practical scenarios, a full 180-degree scan is not feasible, or there is a desire to reduce radiation exposure. In these limited-angle settings, the problem becomes ill-posed, and methods designed for full-view data often leave significant artifacts. We propose a very low-data approach to reconstruct the original image from its Radon transform under severe angle limitations. Because the inverse problem is ill-posed, we combine multiple regularization methods, including Total Variation, a sinogram filter, Deep Image Prior, and a patch-level autoencoder. We use a differentiable implementation of the Radon transform, which allows us to use gradient-based techniques to solve the inverse problem. Our method is evaluated on a dataset from the Helsinki Tomography Challenge 2022, where the goal is to reconstruct a binary disk from its limited-angle sinogram. We only use a total of 12 data points--eight for learning a prior and four for hyperparameter selection--and achieve results comparable to the best synthetic data-driven approaches.

NANov 12, 2024
Numerical Homogenization by Continuous Super-Resolution

Zhi-Song Liu, Roland Maier, Andreas Rupp

Finite element methods typically require a high resolution to satisfactorily approximate micro and even macro patterns of an underlying physical model. This issue can be circumvented by appropriate numerical homogenization or multiscale strategies that are able to obtain reasonable approximations on under-resolved scales. In this paper, we study the implicit neural representation and propose a continuous super-resolution network as a numerical homogenization strategy. It can take coarse finite element data to learn both in-distribution and out-of-distribution high-resolution finite element predictions. Our highlight is the design of a local implicit transformer, which is able to learn multiscale features. We also propose Gabor wavelet-based coordinate encodings which can overcome the bias of neural networks learning low-frequency features. Finally, perception is often preferred over distortion so scientists can recognize the visual pattern for further investigation. However, implicit neural representation is known for its lack of local pattern supervision. We propose to use stochastic cosine similarities to compare the local feature differences between prediction and ground truth. It shows better performance on structural alignments. Our experiments show that our proposed strategy achieves superior performance as an in-distribution and out-of-distribution super-resolution strategy.