45.2LGJun 3Code
MeshTok: Efficient Multi-Scale Tokenization for Scalable PDE TransformersYanshun Zhao, Xiaoyu Peng, Jiamin Jiang et al.
Conventional patchified Transformers operate on uniform spatial partitions, distributing computational effort evenly across the domain irrespective of local features. This inflexible tokenization scheme is inherently limited in its ability to efficiently represent and process solutions to complex PDEs. To address this, we propose MeshTok, an adaptive mesh refinement (AMR)-inspired tokenization and sequence modeling framework. This method selectively refines spatial regions exhibiting sharp gradients, transient features, or multiscale structures, generating a heterogeneous set of multiscale tokens defined on a fixed simulation grid. These tokens are processed within a unified Transformer sequence, enabling the model to simultaneously capture coarse-grained global context and fine-grained local details without requiring specialized architectural components. Although adaptive refinement moderately increases token count, it promotes a more targeted allocation of computational resources to physically informative regions, which we view as a practical inductive bias rather than a formal optimality guarantee. Experimental evaluations across multiple PDE families and benchmark datasets demonstrate that MeshTok consistently improves the efficiency-accuracy trade-off compared to uniform-grid baselines. This suggests adaptive multiscale tokenization as a scalable and generalizable design principle for neural PDE modeling. Code is available at https://github.com/SCAILab-USTC/MeshTok.
46.0LGJun 2
Let There Be Light: Reflection, Refraction and Scattering for Neural OperatorsKeke Wu, Yixuan Zhang, Jingrun Chen
Neural operators learn mappings between infinite-dimensional function spaces and provide a data-driven surrogate modeling paradigm for parametric partial differential equations (PDEs). Existing architectures typically obtain expressivity by parameterizing integral kernels in prescribed transform domains or by applying attention-like interactions over discretized spatial points. While these approaches have achieved substantial progress, they often face a persistent trade-off among physical interpretability, nonlocal spatial communication, mesh scalability, and computational cost. We propose a Light-inspired neural operator(LiNO), an operator-learning architecture whose latent evolution is decomposed into three mechanisms motivated by elementary light transport: reflection, refraction, and scattering. Reflection and refraction act as adaptive pointwise transformations in latent feature space, enabling local feature reorientation and anisotropic modulation, whereas scattering performs input-dependent nonlocal propagation over the physical domain. We first formulate scattering as a normalized pairwise kernel with relative positional bias, and then develop an efficient scattering variant that replaces explicit pairwise interactions with positive-feature global propagation and a local diffusion branch, reducing the dominant spatial complexity from quadratic to linear. This yields a structured neural operator that separates local feature modulation from global spatial communication while retaining a modular and interpretable latent evolution.
LGJun 3, 2023
Exploring the Optimal Choice for Generative Processes in Diffusion Models: Ordinary vs Stochastic Differential EquationsYu Cao, Jingrun Chen, Yixin Luo et al.
The diffusion model has shown remarkable success in computer vision, but it remains unclear whether the ODE-based probability flow or the SDE-based diffusion model is more superior and under what circumstances. Comparing the two is challenging due to dependencies on data distributions, score training, and other numerical issues. In this paper, we study the problem mathematically for two limiting scenarios: the zero diffusion (ODE) case and the large diffusion case. We first introduce a pulse-shape error to perturb the score function and analyze error accumulation of sampling quality, followed by a thorough analysis for generalization to arbitrary error. Our findings indicate that when the perturbation occurs at the end of the generative process, the ODE model outperforms the SDE model with a large diffusion coefficient. However, when the perturbation occurs earlier, the SDE model outperforms the ODE model, and we demonstrate that the error of sample generation due to such a pulse-shape perturbation is exponentially suppressed as the diffusion term's magnitude increases to infinity. Numerical validation of this phenomenon is provided using Gaussian, Gaussian mixture, and Swiss roll distribution, as well as realistic datasets like MNIST and CIFAR-10.
NADec 14, 2017
Estimation of exciton diffusion lengths of organic semiconductors in random domainsJingrun Chen, Ling Lin, Zhiwen Zhang et al.
Exciton diffusion length plays a vital role in the function of opto-electronic devices. Oftentimes, the domain occupied by an organic semiconductor is subject to surface measurement error. In many experiments, photoluminescence over the domain is measured and used as the observation data to estimate this length parameter in an inverse manner based on the least square method. However, the result is sometimes found to be sensitive to the surface geometry of the domain. In this paper, we employ a random function representation for the uncertain surface of the domain. After non-dimensionalization, the forward model becomes a diffusion-type equation over the domain whose geometric boundary is subject to small random perturbations. We propose an asymptotic-based method as an approximate forward solver whose accuracy is justified both theoretically and numerically. It only requires solving several deterministic problems over a fixed domain. Therefore, for the same accuracy requirements we tested here, the running time of our approach is more than one order of magnitude smaller than that of directly solving the original stochastic boundary-value problem by the stochastic collocation method. In addition, from numerical results, we find that the correlation length of randomness is important to determine whether a 1D reduced model is a good surrogate for the 2D model.
NAMay 28, 2012
Ghost Force Influence of a Quasicontinuum Method in Two DimensionJingrun Chen, Pingbing Ming
We derive an analytical expression for the solution of a two-dimensional quasicontinuum method with a planar interface. The expression is used to prove that the ghost force may lead to a finite size error for the gradient of the solution. We estimate the width of the interfacial layer induced by the ghost force is of $\co(\sqrt{\eps}\,)$ with $\eps$ the equilibrium bond length, which is much wider than that of the one-dimensional problem.
42.2NAMay 8
Numerical Homogenization of Landau-Lifshitz Equation with Rough CoefficientsZetao Ma, Jingrun Chen, Rui Du et al.
In this work, we develop a numerical homogenization approach for the fully nonlinear Landau-Lifshitz equation with rough coefficients, including non-periodicity and nonseparable scales. Direct numerical resolution of such multiscale problems on fine meshes incurs prohibitive computational costs. To address this challenge, we propose an efficient coarse scale approximation through localized basis functions derived from energy minimization within the Generalized Rough Polyharmonic Splines (GRPS) framework. These basis functions preserve critical multiscale features while operating on a computationally tractable coarse mesh. The nonlinear, vectorial, and non-symmetric nature of the Landau-Lifshitz equation necessitates careful design of variational formulations for basis construction. We introduce several such formulations, each tailored to specific structural aspects of the problem. Through systematic numerical experiments, we demonstrate that our approach achieves significant computational savings without compromising accuracy, offering a robust framework for simulating multiscale magnetic systems with complex microstructures.
54.1NAMay 15
Finite volume element method for Landau-Lifshitz equationYunjie Gong, Jingrun Chen, Rui Du et al.
The Landau-Lifshitz equation describes the dynamics of magnetization in ferromagnetic materials. Due to the essential nonlinearity and nonconvex constraint, it is typically solved numerically. In this paper, we developed a finite volume element method (FVEM) with the Gauss-Seidel projection method (GSPM) for the micromagnetics simulations. We provide the approximation error in space and depict the energy law when the FVEM is adopted. Owing to the GSPM for time-marching, the discrete system is decoupled component by component, making the computational complexity comparable to that of solving the scalar heat equation implicitly. This significantly accelerates real simulations. We present several numerical experiments to validate the theoretical analysis and the efficiency gain. Additionally, we study the blow-up solution and efficiently simulate the 2D magnetic textures using the proposed method.
CVMar 29, 2025Code
STSA: Spatial-Temporal Semantic Alignment for Visual DubbingZijun Ding, Mingdie Xiong, Congcong Zhu et al.
Existing audio-driven visual dubbing methods have achieved great success. Despite this, we observe that the semantic ambiguity between spatial and temporal domains significantly degrades the synthesis stability for the dynamic faces. We argue that aligning the semantic features from spatial and temporal domains is a promising approach to stabilizing facial motion. To achieve this, we propose a Spatial-Temporal Semantic Alignment (STSA) method, which introduces a dual-path alignment mechanism and a differentiable semantic representation. The former leverages a Consistent Information Learning (CIL) module to maximize the mutual information at multiple scales, thereby reducing the manifold differences between spatial and temporal domains. The latter utilizes probabilistic heatmap as ambiguity-tolerant guidance to avoid the abnormal dynamics of the synthesized faces caused by slight semantic jittering. Extensive experimental results demonstrate the superiority of the proposed STSA, especially in terms of image quality and synthesis stability. Pre-trained weights and inference code are available at https://github.com/SCAILab-USTC/STSA.
11.5COMP-PHApr 17
A Structure-Preserving Graph Neural Solver for Parametric Hyperbolic Conservation LawsJiamin Jiang, Shanglin Lv, Jingrun Chen
Hyperbolic conservation laws govern a wide range of transport-driven dynamics featuring shocks, contact discontinuities, and complex wave interactions, posing distinct challenges for deep-learning-based surrogate modeling. While classical numerical methods provide robust and physically admissible solutions, their computational cost restricts applicability in many-query tasks such as parametric studies and design optimization. Conversely, existing neural surrogates offer rapid inference but often fail to respect intrinsic PDE structures, leading to non-physical artifacts, rollout instability, and poor generalization. We present an interpretable, structure-preserving graph neural solver that bridges classical numerical principles with graph neural networks (GNNs). The network is designed as a learned reconstruction-and-flux operator rather than a black-box state updater, thereby inherently preserving key properties such as local conservation and upwinding. Inspired by Arbitrary high-order DERivatives schemes, we further recast message-passing GNNs as high-order space-time predictors, enabling conservative and stable neural updates with large time steps. Evaluation is performed on challenging supersonic flow benchmarks spanning broad parametric variations in geometry, initial/boundary conditions, and flow regimes. The neural solver achieves superior long-horizon rollout stability and accuracy compared with strong surrogate baselines, outperforms low-order discretizations, and delivers orders-of-magnitude runtime speedups over high-resolution simulations.
CVNov 9, 2025
Physics-Informed Deformable Gaussian Splatting: Towards Unified Constitutive Laws for Time-Evolving Material FieldHaoqin Hong, Ding Fan, Fubin Dou et al.
Recently, 3D Gaussian Splatting (3DGS), an explicit scene representation technique, has shown significant promise for dynamic novel-view synthesis from monocular video input. However, purely data-driven 3DGS often struggles to capture the diverse physics-driven motion patterns in dynamic scenes. To fill this gap, we propose Physics-Informed Deformable Gaussian Splatting (PIDG), which treats each Gaussian particle as a Lagrangian material point with time-varying constitutive parameters and is supervised by 2D optical flow via motion projection. Specifically, we adopt static-dynamic decoupled 4D decomposed hash encoding to reconstruct geometry and motion efficiently. Subsequently, we impose the Cauchy momentum residual as a physics constraint, enabling independent prediction of each particle's velocity and constitutive stress via a time-evolving material field. Finally, we further supervise data fitting by matching Lagrangian particle flow to camera-compensated optical flow, which accelerates convergence and improves generalization. Experiments on a custom physics-driven dataset as well as on standard synthetic and real-world datasets demonstrate significant gains in physical consistency and monocular dynamic reconstruction quality.
CVFeb 20
OODBench: Out-of-Distribution Benchmark for Large Vision-Language ModelsLing Lin, Yang Bai, Heng Su et al.
Existing Visual-Language Models (VLMs) have achieved significant progress by being trained on massive-scale datasets, typically under the assumption that data are independent and identically distributed (IID). However, in real-world scenarios, it is often impractical to expect that all data processed by an AI system satisfy this assumption. Furthermore, failure to appropriately handle out-of-distribution (OOD) objects may introduce safety risks in real-world applications (e.g., autonomous driving or medical assistance). Unfortunately, current research has not yet provided valid benchmarks that can comprehensively assess the performance of VLMs in response to OOD data. Therefore, we propose OODBench, a predominantly automated method with minimal human verification, for constructing new benchmarks and evaluating the ability of VLMs to process OOD data. OODBench contains 40K instance-level OOD instance-category pairs, and we show that current VLMs still exhibit notable performance degradation on OODBench, even when the underlying image categories are common. In addition, we propose a reliable automated assessment metric that employs a Basic-to-Advanced Progression of prompted questions to assess the impact of OOD data on questions of varying difficulty more fully. Lastly, we summarize substantial findings and insights to facilitate future research in the acquisition and evaluation of OOD data.
IRMay 28, 2025
Xinyu AI Search: Enhanced Relevance and Comprehensive Results with Rich Answer PresentationsBo Tang, Junyi Zhu, Chenyang Xi et al.
Traditional search engines struggle to synthesize fragmented information for complex queries, while generative AI search engines face challenges in relevance, comprehensiveness, and presentation. To address these limitations, we introduce Xinyu AI Search, a novel system that incorporates a query-decomposition graph to dynamically break down complex queries into sub-queries, enabling stepwise retrieval and generation. Our retrieval pipeline enhances diversity through multi-source aggregation and query expansion, while filtering and re-ranking strategies optimize passage relevance. Additionally, Xinyu AI Search introduces a novel approach for fine-grained, precise built-in citation and innovates in result presentation by integrating timeline visualization and textual-visual choreography. Evaluated on recent real-world queries, Xinyu AI Search outperforms eight existing technologies in human assessments, excelling in relevance, comprehensiveness, and insightfulness. Ablation studies validate the necessity of its key sub-modules. Our work presents the first comprehensive framework for generative AI search engines, bridging retrieval, generation, and user-centric presentation.
NASep 1, 2023
Solving multiscale elliptic problems by sparse radial basis function neural networksZhiwen Wang, Minxin Chen, Jingrun Chen
Machine learning has been successfully applied to various fields of scientific computing in recent years. In this work, we propose a sparse radial basis function neural network method to solve elliptic partial differential equations (PDEs) with multiscale coefficients. Inspired by the deep mixed residual method, we rewrite the second-order problem into a first-order system and employ multiple radial basis function neural networks (RBFNNs) to approximate unknown functions in the system. To aviod the overfitting due to the simplicity of RBFNN, an additional regularization is introduced in the loss function. Thus the loss function contains two parts: the $L_2$ loss for the residual of the first-order system and boundary conditions, and the $\ell_1$ regularization term for the weights of radial basis functions (RBFs). An algorithm for optimizing the specific loss function is introduced to accelerate the training process. The accuracy and effectiveness of the proposed method are demonstrated through a collection of multiscale problems with scale separation, discontinuity and multiple scales from one to three dimensions. Notably, the $\ell_1$ regularization can achieve the goal of representing the solution by fewer RBFs. As a consequence, the total number of RBFs scales like $\mathcal{O}(\varepsilon^{-nτ})$, where $\varepsilon$ is the smallest scale, $n$ is the dimensionality, and $τ$ is typically smaller than $1$. It is worth mentioning that the proposed method not only has the numerical convergence and thus provides a reliable numerical solution in three dimensions when a classical method is typically not affordable, but also outperforms most other available machine learning methods in terms of accuracy and robustness.
NANov 5, 2019
Quasi-Monte Carlo sampling for machine-learning partial differential equationsJingrun Chen, Rui Du, Panchi Li et al.
Solving partial differential equations in high dimensions by deep neural network has brought significant attentions in recent years. In many scenarios, the loss function is defined as an integral over a high-dimensional domain. Monte-Carlo method, together with the deep neural network, is used to overcome the curse of dimensionality, while classical methods fail. Often, a deep neural network outperforms classical numerical methods in terms of both accuracy and efficiency. In this paper, we propose to use quasi-Monte Carlo sampling, instead of Monte-Carlo method to approximate the loss function. To demonstrate the idea, we conduct numerical experiments in the framework of deep Ritz method proposed by Weinan E and Bing Yu. For the same accuracy requirement, it is observed that quasi-Monte Carlo sampling reduces the size of training data set by more than two orders of magnitude compared to that of MC method. Under some assumptions, we prove that quasi-Monte Carlo sampling together with the deep neural network generates a convergent series with rate proportional to the approximation accuracy of quasi-Monte Carlo method for numerical integration. Numerically the fitted convergence rate is a bit smaller, but the proposed approach always outperforms Monte Carlo method. It is worth mentioning that the convergence analysis is generic whenever a loss function is approximated by the quasi-Monte Carlo method, although observations here are based on deep Ritz method.
COMP-PHOct 31, 2019
A QMC-deep learning method for diffusivity estimation in random domainsLiyao Lyu, Zhiwen Zhang, Jingrun Chen
Exciton diffusion plays a vital role in the function of many organic semiconducting opto-electronic devices, where an accurate description requires precise control of heterojunctions. This poses a challenging problem because the parameterization of heterojunctions in high-dimensional random space is far beyond the capability of classical simulation tools. Here, we develop a novel method based on quasi-Monte Carlo sampling to generate the training data set and deep neural network to extract a function for exciton diffusion length on surface roughness with high accuracy and unprecedented efficiency, yielding an abundance of information over the entire parameter space. Our method provides a new strategy to analyze the impact of interfacial ordering on exciton diffusion and is expected to assist experimental design with tailored opto-electronic functionalities.
NAMay 8, 2015
An atomistic/continuum coupling method using enriched basesJingrun Chen, Carlos J. García-Cervera, Xiantao Li
A common observation from an atomistic to continuum coupling method is that the error is often generated and concentrated near the interface, where the two models are combined. In this paper, a new method is proposed to suppress the error at the interface, and as a consequence, the overall accuracy is improved. The method is motivated by formulating the molecular mechanics model as a two-stage minimization problem. In particular, it is demonstrated that the error at the interface can be considerably reduced when new basis functions are introduced in a Galerkin projection formalism. The improvement of the accuracy is illustrated by two examples. Further, the comparison to some quasicontinuum-type methods is provided.
NAApr 6, 2015
Analysis of the divide-and-conquer method for electronic structure calculationsJingrun Chen, Jianfeng Lu
We study the accuracy of the divide-and-conquer method for electronic structure calculations. The analysis is conducted for a prototypical subdomain problem in the method. We prove that the pointwise difference between electron densities of the global system and the subsystem decays exponentially as a function of the distance away from the boundary of the subsystem, under the gap assumption of both the global system and the subsystem. We show that gap assumption is crucial for the accuracy of the divide-and-conquer method by numerical examples. In particular, we show examples with the loss of accuracy when the gap assumption of the subsystem is invalid.