NAFeb 22, 2019
A linearized energy--conservative finite element method for the nonlinear Schrödinger equation with wave operatorWentao Cai, Dongdong He, Kejia Pan
In this paper, we propose a linearized finite element method (FEM) for solving the cubic nonlinear Schrödinger equation with wave operator. In this method, a modified leap-frog scheme is applied for time discretization and a Galerkin finite element method is applied for spatial discretization. We prove that the proposed method keeps the energy conservation in the given discrete norm. Comparing with non-conservative schemes, our algorithm keeps higher stability. Meanwhile, an optimal error estimate for the proposed scheme is given by an error splitting technique. That is, we split the error into two parts, one from temporal discretization and the other from spatial discretization. First, by introducing a time-discrete system, we prove the uniform boundedness for the solution of this time-discrete system in some strong norms and obtain error estimates in temporal direction. With the help of the preliminary temporal estimates, we then prove the pointwise uniform boundedness of the finite element solution, and obtain the optimal $L^2$-norm error estimates in the sense that the time step size is not related to spatial mesh size. Finally, numerical examples are provided to validate the convergence-order, unconditional stability and energy conservation.
NAApr 19, 2018
A fourth-order maximum principle preserving operator splitting scheme for three-dimensional fractional Allen-Cahn equationsDongdong He, Kejia Pan, Hongling Hu
In this paper, by using Strang's second-order splitting method, the numerical procedure for the three-dimensional (3D) space fractional Allen-Cahn equation can be divided into three steps. The first and third steps involve an ordinary differential equation, which can be solved analytically. The intermediate step involves a 3D linear fractional diffusion equation, which is solved by the Crank-Nicolson alternating directional implicit (ADI) method. The ADI technique can convert the multidimensional problem into a series of one-dimensional problems, which greatly reduces the computational cost. A fourth-order difference scheme is adopted for discretization of the space fractional derivatives. Finally, Richardson extrapolation is exploited to increase the temporal accuracy. The proposed method is shown to be unconditionally stable by Fourier analysis. Another contribution of this paper is to show that the numerical solutions satisfy the discrete maximum principle under reasonable time step constraint. For fabricated smooth solutions, numerical results show that the proposed method is unconditionally stable and fourth-order accurate in both time and space variables. In addition, the discrete maximum principle is also numerically verified.
NAMay 31, 2018
A three-level linearized difference scheme for the coupled nonlinear fractional Ginzburg-Landau equationDongdong He, Kejia Pan
In this paper, the coupled fractional Ginzburg-Landau equations are first time investigated numerically. A linearized implicit finite difference scheme is proposed. The scheme involves three time levels, is unconditionally stable and second-order accurate in both time and space variables. The unique solvability, the unconditional stability and optimal pointwise error estimates are obtained by using the energy method and mathematical induction. Moreover, the proposed second-order method can be easily extended into the fourth-order method by using an average finite difference operator for spatial fractional derivatives and Richardson extrapolation for time variable. Finally, numerical results are presented to confirm the theoretical results.
CEDec 5, 2017
Wave analysis in one dimensional structures with a wavelet finite element model and precise integration methodShuaifang Zhang, Dongdong He, Dongsheng Li et al.
Numerical simulation of ultrasonic wave propagation provides an efficient tool for crack identification in structures, while it requires a high resolution and expensive time calculation cost in both time integration and spatial discretization. Wavelet finite element model provides a highorder finite element model and gives a higher accuracy on spatial discretization, B-Spline wavelet interval (BSWI) has been proved to be one of the most commonly used wavelet finite element model with the advantage of getting the same accuracy but with fewer element so that the calculation cost is much lower than traditional finite element method and other high-order element methods. Precise Integration Method provides a higher resolution in time integration and has been proved to be a stable time integration method with a much lower cut-off error for same and even smaller time step. In this paper, a wavelet finite element model combined with precise integration method is presented for the numerical simulation of ultrasonic wave propagation and crack identification in 1D structures. Firstly, the wavelet finite element based on BSWI is constructed for rod and beam structures. Then Precise Integrated Method is introduced with application for the wave propagation in 1D structures. Finally, numerical examples of ultrasonic wave propagation in rod and beam structures are conducted for verification. Moreover, crack identification in both rod and beam structures are studied based on the new model.
CLDec 30, 2025
Training Report of TeleChat3-MoEXinzhang Liu, Chao Wang, Zhihao Yang et al.
TeleChat3-MoE is the latest series of TeleChat large language models, featuring a Mixture-of-Experts (MoE) architecture with parameter counts ranging from 105 billion to over one trillion,trained end-to-end on Ascend NPU cluster. This technical report mainly presents the underlying training infrastructure that enables reliable and efficient scaling to frontier model sizes. We detail systematic methodologies for operator-level and end-to-end numerical accuracy verification, ensuring consistency across hardware platforms and distributed parallelism strategies. Furthermore, we introduce a suite of performance optimizations, including interleaved pipeline scheduling, attention-aware data scheduling for long-sequence training,hierarchical and overlapped communication for expert parallelism, and DVM-based operator fusion. A systematic parallelization framework, leveraging analytical estimation and integer linear programming, is also proposed to optimize multi-dimensional parallelism configurations. Additionally, we present methodological approaches to cluster-level optimizations, addressing host- and device-bound bottlenecks during large-scale training tasks. These infrastructure advancements yield significant throughput improvements and near-linear scaling on clusters comprising thousands of devices, providing a robust foundation for large-scale language model development on hardware ecosystems.
CVMay 13, 2025
Monocular Depth Guided Occlusion-Aware Disparity Refinement via Semi-supervised Learning in Laparoscopic ImagesZiteng Liu, Dongdong He, Chenghong Zhang et al.
Occlusion and the scarcity of labeled surgical data are significant challenges in disparity estimation for stereo laparoscopic images. To address these issues, this study proposes a Depth Guided Occlusion-Aware Disparity Refinement Network (DGORNet), which refines disparity maps by leveraging monocular depth information unaffected by occlusion. A Position Embedding (PE) module is introduced to provide explicit spatial context, enhancing the network's ability to localize and refine features. Furthermore, we introduce an Optical Flow Difference Loss (OFDLoss) for unlabeled data, leveraging temporal continuity across video frames to improve robustness in dynamic surgical scenes. Experiments on the SCARED dataset demonstrate that DGORNet outperforms state-of-the-art methods in terms of End-Point Error (EPE) and Root Mean Squared Error (RMSE), particularly in occlusion and texture-less regions. Ablation studies confirm the contributions of the Position Embedding and Optical Flow Difference Loss, highlighting their roles in improving spatial and temporal consistency. These results underscore DGORNet's effectiveness in enhancing disparity estimation for laparoscopic surgery, offering a practical solution to challenges in disparity estimation and data limitations.
NAAug 31, 2016
An extrapolation cascadic multigrid method combined with a fourth order compact scheme for 3D poisson equationKejia Pan, Dongdong He, Hongling Hu
In this paper, we develop an EXCMG method to solve the three-dimensional Poisson equation on rectangular domains by using the compact finite difference (FD) method with unequal meshsizes in different coordinate directions. The resulting linear system from compact FD discretization is solved by the conjugate gradient (CG) method with a relative residual stopping criterion. By combining the Richardson extrapolation and tri-quartic Lagrange interpolation for the numerical solutions from two-level of grids (current and previous grids), we are able to produce an extremely accurate approximation of the actual numerical solution on the next finer grid, which can greatly reduce the number of relaxation sweeps needed. Additionally, a simple method based on the midpoint extrapolation formula is used for the fourth-order FD solutions on two-level of grids to achieve sixth-order accuracy on the entire fine grid cheaply and directly. The gradient of the numerical solution can also be easily obtained through solving a series of tridiagonal linear systems resulting from the fourth-order compact FD discretizations. Numerical results show that our EXCMG method is much more efficient than the classical V-cycle and W-cycle multigrid methods. Moreover, only few CG iterations are required on the finest grid to achieve full fourth-order accuracy in both the $L^2$-norm and $L^{\infty}$-norm for the solution and its gradient when the exact solution belongs to $C^6$. Finally, numerical result shows that our EXCMG method is still effective when the exact solution has a lower regularity, which widens the scope of applicability of our EXCMG method.
NASep 15, 2015
A linearly implicit conservative difference scheme for the generalized Rosenau-Kawahara-RLW equationDongdong He, Kejia Pan
This paper concerns the numerical study for the generalized Rosenau-Kawahara-RLW equation obtained by coupling the generalized Rosenau-RLW equation and the generalized Rosenau-Kawahara equation. We first derive the energy conservation law of the equation, and then develop a three-level linearly implicit difference scheme for solving the equation. We prove that the proposed scheme is energy-conserved, unconditionally stable and second-order accurate both in time and space variables. Finally, numerical experiments are carried out to confirm the energy conservation, the convergence rates of the scheme and effectiveness for long-time simulation.
NAJun 9, 2015
A new extrapolation cascadic multigrid method for 3D elliptic boundary value problems on rectangular domainsKejia Pan, Dongdong He, Hongling Hu
In this paper, we develop a new extrapolation cascadic multigrid (ECMG$_{jcg}$) method, which makes it possible to solve 3D elliptic boundary value problems on rectangular domains of over 100 million unknowns on a desktop computer in minutes. First, by combining Richardson extrapolation and tri-quadratic Serendipity interpolation techniques, we introduce a new extrapolation formula to provide a good initial guess for the iterative solution on the next finer grid, which is a third order approximation to the finite element (FE) solution. And the resulting large sparse linear system from the FE discretization is then solved by the Jacobi-preconditioned Conjugate Gradient (JCG) method. Additionally, instead of performing a fixed number of iterations as cascadic multigrid (CMG) methods, a relative residual stopping criterion is used in iterative solvers, which enables us to obtain conveniently the numerical solution with the desired accuracy. Moreover, a simple Richardson extrapolation is used to cheaply get a fourth order approximate solution on the entire fine grid. Test results are reported to show that ECMG$_{jcg}$ has much better efficiency compared to the classical MG methods. Since the initial guess for the iterative solution is a quite good approximation to the FE solution, numerical results show that only few number of iterations are required on the finest grid for ECMG$_{jcg}$ with an appropriate tolerance of the relative residual to achieve full second order accuracy, which is particularly important when solving large systems of equations and can greatly reduce the computational cost. It should be pointed out that when the tolerance becomes smaller, ECMG$_{jcg}$ still needs only few iterations to obtain fourth order extrapolated solution on each grid, except on the finest grid. Finally, we present the reason why our ECMG algorithms are so highly efficient for solving such problems.
NAMay 31, 2015
An energy preserving finite difference scheme for the Poisson-Nernst-Planck systemDongdong He, Kejia Pan
In this paper, we construct a semi-implicit finite difference method for the time dependent Poisson-Nernst-Planck system. Although the Poisson-Nernst-Planck system is a nonlinear system, the numerical method presented in this paper only needs to solve a linear system at each time step, which can be done very efficiently. The rigorous proof for the mass conservation and electric potential energy decay are shown. Moreover, mesh refinement analysis shows that the method is second order convergent in space and first order convergent in time. Finally we point out that our method can be easily extended to the case of multi-ions.