Haijun Yu

NA
h-index25
25papers
164citations
Novelty52%
AI Score55

25 Papers

NAJan 17, 2017
Numerical Approximations for a Phase-Field Moving Contact Line Model with Variable Densities and Viscosities

Haijun Yu, Xiaofeng Yang

We consider the numerical approximations of a two-phase hydrodynamics coupled phase-field model that incorporates the variable densities, viscosities and moving contact line boundary conditions. The model is a nonlinear, coupled system that consists of incompressible Navier--Stokes equations with the generalized Navier boundary condition, and the Cahn--Hilliard equations with moving contact line boundary conditions. By some subtle explicit--implicit treatments to nonlinear terms, we develop two efficient, unconditionally energy stable numerical schemes, in particular, a linear decoupled energy stable scheme for the system with static contact line condition, and a nonlinear energy stable scheme for the system with dynamic contact line condition. An efficient spectral-Galerkin spatial discretization is implemented to verify the accuracy and efficiency of proposed schemes. Various numerical results show that the proposed schemes are efficient and accurate.

SOFTAug 8, 2023
Constructing Custom Thermodynamics Using Deep Learning

Xiaoli Chen, Beatrice W. Soh, Zi-En Ooi et al.

One of the most exciting applications of artificial intelligence (AI) is automated scientific discovery based on previously amassed data, coupled with restrictions provided by known physical principles, including symmetries and conservation laws. Such automated hypothesis creation and verification can assist scientists in studying complex phenomena, where traditional physical intuition may fail. Here we develop a platform based on a generalized Onsager principle to learn macroscopic dynamical descriptions of arbitrary stochastic dissipative systems directly from observations of their microscopic trajectories. Our method simultaneously constructs reduced thermodynamic coordinates and interprets the dynamics on these coordinates. We demonstrate its effectiveness by studying theoretically and validating experimentally the stretching of long polymer chains in an externally applied field. Specifically, we learn three interpretable thermodynamic coordinates and build a dynamical landscape of polymer stretching, including the identification of stable and transition states and the control of the stretching rate. Our general methodology can be used to address a wide range of scientific and technological applications.

NAMay 22, 2018
On Efficient Second Order Stabilized Semi-Implicit Schemes for the Cahn-Hilliard Phase-Field Equation

Lin Wang, Haijun Yu

Efficient and energy stable high order time marching schemes are very important but not easy to construct for the study of nonlinear phase dynamics. In this paper, we propose and study two linearly stabilized second order semi-implicit schemes for the Cahn-Hilliard phase-field equation. One uses backward differentiation formula and the other uses Crank-Nicolson method to discretize linear terms. In both schemes, the nonlinear bulk forces are treated explicitly with two second-order stabilization terms. This treatment leads to linear elliptic systems with constant coefficients, for which lots of robust and efficient solvers are available. The discrete energy dissipation properties are proved for both schemes. Rigorous error analysis is carried out to show that, when the time step-size is small enough, second order accuracy in time is obtained with a prefactor controlled by a fixed power of $1/\varepsilon$, where $\varepsilon$ is the characteristic interface thickness. Numerical results are presented to verify the accuracy and efficiency of proposed schemes.

NAOct 9, 2017
Convergence Analysis of an Unconditionally Energy Stable Linear Crank-Nicolson Scheme for the Cahn-Hilliard Equation

Lin Wang, Haijun Yu

Efficient and unconditionally stable high order time marching schemes are very important but not easy to construct for nonlinear phase dynamics. In this paper, we propose and analysis an efficient stabilized linear Crank-Nicolson scheme for the Cahn-Hilliard equation with provable unconditional stability. In this scheme the nonlinear bulk force are treated explicitly with two second-order linear stabilization terms. The semi-discretized equation is a linear elliptic system with constant coefficients, thus robust and efficient solution procedures are guaranteed. Rigorous error analysis show that, when the time step-size is small enough, the scheme is second order accurate in time with aprefactor controlled by some lower degree polynomial of $1/\varepsilon$. Here $\varepsilon$ is the interface thickness parameter. Numerical results are presented to verify the accuracy and efficiency of the scheme.

NAMar 20, 2018
Convergence analysis of a finite element approximation of minimum action methods

Xiaoliang Wan, Haijun Yu, Jiayu Zhai

In this work, we address the convergence of a finite element approximation of the minimizer of the Freidlin-Wentzell (F-W) action functional for non-gradient dynamical systems perturbed by small noise. The F-W theory of large deviations is a rigorous mathematical tool to study small-noise-induced transitions in a dynamical system. The central task in the application of F-W theory of large deviations is to seek the minimizer and minimum of the F-W action functional. We discretize the F-W action functional using linear finite elements, and establish the convergence of {the approximation} through $Γ$-convergence.

73.0NAMay 26
Predictive Moving Sample Method for Physics-Informed Neural Solvers of Time-Dependent PDEs

Beining Xu, Bocheng Zhang, Haijun Yu et al.

Time-dependent partial differential equations (PDEs) often develop sharp fronts, localized peaks, and other moving structures that occupy only a small portion of the space--time domain but dominate the approximation error. This makes fixed or uniformly sampled collocation strategies inefficient for physics-informed neural networks (PINNs), especially in high dimensions and over long-time prediction intervals. We propose the predictive moving sample method (PMSM), which builds on the moving sample method (MSM) in \cite{xu2026moving} by replacing its full time domain iterative training with a progressive time-stepping strategy and simplifying the velocity-field loss to further reduce the per-step cost. To improve practicality for long-time prediction, we further introduce the windowed-reset predictive moving sample method (WR-PMSM), which restricts extension training to an active time window and periodically resets the reference state, thereby reducing the growth of optimization cost while preserving global consistency through a final refinement stage. Across four representative benchmarks, PMSM consistently outperforms both standard PINNs and the original MSM under matched collocation budgets. These results suggest that transporting samples according to residual dynamics provides an effective and practical route to neural network solvers for time-dependent PDEs.

NAOct 31, 2018
Energy Stable Second Order Linear Schemes for the Allen-Cahn Phase-Field Equation

Lin Wang, Haijun Yu

Phase-field model is a powerful mathematical tool to study the dynamics of interface and morphology changes in fluid mechanics and material sciences. However, numerically solving a phase field model for a real problem is a challenge task due to the non-convexity of the bulk energy and the small interface thickness parameter in the equation. In this paper, we propose two stabilized second order semi-implicit linear schemes for the Allen-Cahn phase-field equation based on backward differentiation formula and Crank-Nicolson method, respectively. In both schemes, the nonlinear bulk force is treated explicitly with two second-order stabilization terms, which make the schemes unconditional energy stable and numerically efficient. By using a known result of the spectrum estimate of the linearized Allen-Cahn operator and some regularity estimate of the exact solution, we obtain an optimal second order convergence in time with a prefactor depending on the inverse of the characteristic interface thickness only in some lower polynomial order. Both 2-dimensional and 3-dimensional numerical results are presented to verify the accuracy and efficiency of proposed schemes.

NAFeb 25, 2019
Numerical approximation of elliptic problems with log-normal random coefficients

Xiaoliang Wan, Haijun Yu

In this work, we consider a non-standard preconditioning strategy for the numerical approximation of the classical elliptic equations with log-normal random coefficients. In \cite{Wan_model}, a Wick-type elliptic model was proposed by modeling the random flux through the Wick product. Due to the lower-triangular structure of the uncertainty propagator, this model can be approximated efficiently using the Wiener chaos expansion in the probability space. Such a Wick-type model provides, in general, a second-order approximation of the classical one in terms of the standard deviation of the underlying Gaussian process. Furthermore, when the correlation length of the underlying Gaussian process goes to infinity, the Wick-type model yields the same solution as the classical one. These observations imply that the Wick-type elliptic equation can provide an effective preconditioner for the classical random elliptic equation under appropriate conditions. We use the Wick-type elliptic model to accelerate the Monte Carlo method and the stochastic Galerkin finite element method. Numerical results are presented and discussed.

CVSep 21, 2023
OSNet & MNetO: Two Types of General Reconstruction Architectures for Linear Computed Tomography in Multi-Scenarios

Zhisheng Wang, Zihan Deng, Fenglin Liu et al.

Recently, linear computed tomography (LCT) systems have actively attracted attention. To weaken projection truncation and image the region of interest (ROI) for LCT, the backprojection filtration (BPF) algorithm is an effective solution. However, in BPF for LCT, it is difficult to achieve stable interior reconstruction, and for differentiated backprojection (DBP) images of LCT, multiple rotation-finite inversion of Hilbert transform (Hilbert filtering)-inverse rotation operations will blur the image. To satisfy multiple reconstruction scenarios for LCT, including interior ROI, complete object, and exterior region beyond field-of-view (FOV), and avoid the rotation operations of Hilbert filtering, we propose two types of reconstruction architectures. The first overlays multiple DBP images to obtain a complete DBP image, then uses a network to learn the overlying Hilbert filtering function, referred to as the Overlay-Single Network (OSNet). The second uses multiple networks to train different directional Hilbert filtering models for DBP images of multiple linear scannings, respectively, and then overlays the reconstructed results, i.e., Multiple Networks Overlaying (MNetO). In two architectures, we introduce a Swin Transformer (ST) block to the generator of pix2pixGAN to extract both local and global features from DBP images at the same time. We investigate two architectures from different networks, FOV sizes, pixel sizes, number of projections, geometric magnification, and processing time. Experimental results show that two architectures can both recover images. OSNet outperforms BPF in various scenarios. For the different networks, ST-pix2pixGAN is superior to pix2pixGAN and CycleGAN. MNetO exhibits a few artifacts due to the differences among the multiple models, but any one of its models is suitable for imaging the exterior edge in a certain direction.

NAOct 9, 2017
A Laguerre homotopy method for optimal control of nonlinear systems in semi-infinite interval

Haijun Yu, Hassan Saberi Nik

This paper presents a Laguerre homotopy method for optimal control problems in semi-infinite intervals (LaHOC), with particular interests given to nonlinear interconnected large-scale dynamic systems. In LaHOC, spectral homotopy analysis method is used to derive an iterative solver for the nonlinear two-point boundary value problem derived from Pontryagins maximum principle. A proof of local convergence of the LaHOC is provided. Numerical comparisons are made between the LaHOC, Matlab BVP5C generated results and results from literature for two nonlinear optimal control problems. The results show that LaHOC is superior in both accuracy and efficiency.

CVJan 28
Structure-constrained Language-informed Diffusion Model for Unpaired Low-dose Computed Tomography Angiography Reconstruction

Genyuan Zhang, Zihao Wang, Zhifan Gao et al.

The application of iodinated contrast media (ICM) improves the sensitivity and specificity of computed tomography (CT) for a wide range of clinical indications. However, overdose of ICM can cause problems such as kidney damage and life-threatening allergic reactions. Deep learning methods can generate CT images of normal-dose ICM from low-dose ICM, reducing the required dose while maintaining diagnostic power. However, existing methods are difficult to realize accurate enhancement with incompletely paired images, mainly because of the limited ability of the model to recognize specific structures. To overcome this limitation, we propose a Structure-constrained Language-informed Diffusion Model (SLDM), a unified medical generation model that integrates structural synergy and spatial intelligence. First, the structural prior information of the image is effectively extracted to constrain the model inference process, thus ensuring structural consistency in the enhancement process. Subsequently, semantic supervision strategy with spatial intelligence is introduced, which integrates the functions of visual perception and spatial reasoning, thus prompting the model to achieve accurate enhancement. Finally, the subtraction angiography enhancement module is applied, which serves to improve the contrast of the ICM agent region to suitable interval for observation. Qualitative analysis of visual comparison and quantitative results of several metrics demonstrate the effectiveness of our method in angiographic reconstruction for low-dose contrast medium CT angiography.

NAOct 29, 2025
Scaling Optimized Hermite Approximation Methods

Hao Hu, Haijun Yu

Hermite polynomials and functions have extensive applications in scientific and engineering problems. Although it is recognized that employing the scaled Hermite functions rather than the standard ones can remarkably enhance the approximation performance, the understanding of the scaling factor remains insufficient. Due to the lack of theoretical analysis, recent publications still cast doubt on whether the Hermite spectral method is inferior to other methods. To dispel this doubt, we show in this article that the inefficiency of the Hermite spectral method comes from the imbalance in the decay speed of the objective function within the spatial and frequency domains. Proper scaling can render the Hermite spectral methods comparable to other methods. To make it solid, we propose a novel error analysis framework for the scaled Hermite approximation. Taking the $L^2$ projection error as an example, our framework illustrates that there are three different components of errors: the spatial truncation error, the frequency truncation error, and the Hermite spectral approximation error. Through this perspective, finding the optimal scaling factor is equivalent to balancing the spatial and frequency truncation errors. As applications, we show that geometric convergence can be recovered by proper scaling for a class of functions. Furthermore, we show that proper scaling can double the convergence order for smooth functions with algebraic decay. The perplexing pre-asymptotic sub-geometric convergence when approximating algebraic decay functions can be perfectly explained by this framework.

NAJun 24, 2024
Energetic Spectral-Element Time Marching Methods for Phase-Field Nonlinear Gradient Systems

Shiqin Liu, Haijun Yu

We propose two efficient energetic spectral-element methods in time for marching nonlinear gradient systems with the phase-field Allen--Cahn equation as an example: one fully implicit nonlinear method and one semi-implicit linear method. Different from other spectral methods in time using spectral Petrov-Galerkin or weighted Galerkin approximations, the presented implicit method employs an energetic variational Galerkin form that can maintain the mass conservation and energy dissipation property of the continuous dynamical system. Another advantage of this method is its superconvergence. A high-order extrapolation is adopted for the nonlinear term to get the semi-implicit method. The semi-implicit method does not have superconvergence, but can be improved by a few Picard-like iterations to recover the superconvergence of the implicit method. Numerical experiments verify that the method using Legendre elements of degree three outperforms the 4th-order implicit-explicit backward differentiation formula and the 4th-order exponential time difference Runge-Kutta method, which were known to have best performances in solving phase-field equations. In addition to the standard Allen--Cahn equation, we also apply the method to a conservative Allen--Cahn equation, in which the conservation of discrete total mass is verified. The applications of the proposed methods are not limited to phase-field Allen--Cahn equations. They are suitable for solving general, large-scale nonlinear dynamical systems.

5.5CVApr 1
Improving Generalization of Deep Learning for Brain Metastases Segmentation Across Institutions

Yuchen Yang, Shuangyang Zhong, Haijun Yu et al.

Background: Deep learning has demonstrated significant potential for automated brain metastases (BM) segmentation; however, models trained at a singular institution often exhibit suboptimal performance at various sites due to disparities in scanner hardware, imaging protocols, and patient demographics. The goal of this work is to create a domain adaptation framework that will allow for BM segmentation to be used across multiple institutions. Methods: We propose a VAE-MMD preprocessing pipeline that combines variational autoencoders (VAE) with maximum mean discrepancy (MMD) loss, incorporating skip connections and self-attention mechanisms alongside nnU-Net segmentation. The method was tested on 740 patients from four public databases: Stanford, UCSF, UCLM, and PKG, evaluated by domain classifier's accuracy, sensitivity, precision, F1/F2 scores, surface Dice (sDice), and 95th percentile Hausdorff distance (HD95). Results: VAE-MMD reduced domain classifier accuracy from 0.91 to 0.50, indicating successful feature alignment across institutions. Reconstructed volumes attained a PSNR greater than 36 dB, maintaining anatomical accuracy. The combined method raised the mean F1 by 11.1% (0.700 to 0.778), the mean sDice by 7.93% (0.7121 to 0.7686), and reduced the mean HD95 by 65.5% (11.33 to 3.91 mm) across all four centers compared to the baseline nnU-Net. Conclusions: VAE-MMD effectively diminishes cross-institutional data heterogeneity and enhances BM segmentation generalization across volumetric, detection, and boundary-level metrics without necessitating target-domain labels, thereby overcoming a significant obstacle to the clinical implementation of AI-assisted segmentation.

NANov 15, 2024
A Natural Deep Ritz Method for Essential Boundary Value Problems

Haijun Yu, Shuo Zhang

Deep neural network approaches show promise in solving partial differential equations. However, unlike traditional numerical methods, they face challenges in enforcing essential boundary conditions. The widely adopted penalty-type methods, for example, offer a straightforward implementation but introduces additional complexity due to the need for hyper-parameter tuning; moreover, the use of a large penalty parameter can lead to artificial extra stiffness, complicating the optimization process. In this paper, we propose a novel, intrinsic approach to impose essential boundary conditions through a framework inspired by intrinsic structures. We demonstrate the effectiveness of this approach using the deep Ritz method applied to Poisson problems, with the potential for extension to more general equations and other deep learning techniques. Numerical results are provided to substantiate the efficiency and robustness of the proposed method.

IVJan 17, 2024
A gradient-based approach to fast and accurate head motion compensation in cone-beam CT

Mareike Thies, Fabian Wagner, Noah Maul et al.

Cone-beam computed tomography (CBCT) systems, with their flexibility, present a promising avenue for direct point-of-care medical imaging, particularly in critical scenarios such as acute stroke assessment. However, the integration of CBCT into clinical workflows faces challenges, primarily linked to long scan duration resulting in patient motion during scanning and leading to image quality degradation in the reconstructed volumes. This paper introduces a novel approach to CBCT motion estimation using a gradient-based optimization algorithm, which leverages generalized derivatives of the backprojection operator for cone-beam CT geometries. Building on that, a fully differentiable target function is formulated which grades the quality of the current motion estimate in reconstruction space. We drastically accelerate motion estimation yielding a 19-fold speed-up compared to existing methods. Additionally, we investigate the architecture of networks used for quality metric regression and propose predicting voxel-wise quality maps, favoring autoencoder-like architectures over contracting ones. This modification improves gradient flow, leading to more accurate motion estimation. The presented method is evaluated through realistic experiments on head anatomy. It achieves a reduction in reprojection error from an initial average of 3mm to 0.61mm after motion compensation and consistently demonstrates superior performance compared to existing approaches. The analytic Jacobian for the backprojection operation, which is at the core of the proposed method, is made publicly available. In summary, this paper contributes to the advancement of CBCT integration into clinical workflows by proposing a robust motion estimation approach that enhances efficiency and accuracy, addressing critical challenges in time-sensitive scenarios.

IVAug 15, 2025
Efficient Image-to-Image Schrödinger Bridge for CT Field of View Extension

Zhenhao Li, Long Yang, Xiaojie Yin et al.

Computed tomography (CT) is a cornerstone imaging modality for non-invasive, high-resolution visualization of internal anatomical structures. However, when the scanned object exceeds the scanner's field of view (FOV), projection data are truncated, resulting in incomplete reconstructions and pronounced artifacts near FOV boundaries. Conventional reconstruction algorithms struggle to recover accurate anatomy from such data, limiting clinical reliability. Deep learning approaches have been explored for FOV extension, with diffusion generative models representing the latest advances in image synthesis. Yet, conventional diffusion models are computationally demanding and slow at inference due to their iterative sampling process. To address these limitations, we propose an efficient CT FOV extension framework based on the image-to-image Schrödinger Bridge (I$^2$SB) diffusion model. Unlike traditional diffusion models that synthesize images from pure Gaussian noise, I$^2$SB learns a direct stochastic mapping between paired limited-FOV and extended-FOV images. This direct correspondence yields a more interpretable and traceable generative process, enhancing anatomical consistency and structural fidelity in reconstructions. I$^2$SB achieves superior quantitative performance, with root-mean-square error (RMSE) values of 49.8\,HU on simulated noisy data and 152.0HU on real data, outperforming state-of-the-art diffusion models such as conditional denoising diffusion probabilistic models (cDDPM) and patch-based diffusion methods. Moreover, its one-step inference enables reconstruction in just 0.19s per 2D slice, representing over a 700-fold speedup compared to cDDPM (135s) and surpassing diffusionGAN (0.58s), the second fastest. This combination of accuracy and efficiency makes I$^2$SB highly suitable for real-time or clinical deployment.

CVMay 30, 2023
BPF Algorithms for Multiple Source-Translation Computed Tomography Reconstruction

Zhisheng Wang, Haijun Yu, Yixing Huang et al.

Micro-computed tomography (micro-CT) is a widely used state-of-the-art instrument employed to study the morphological structures of objects in various fields. However, its small field-of-view (FOV) cannot meet the pressing demand for imaging relatively large objects at high spatial resolutions. Recently, we devised a novel scanning mode called multiple source translation CT (mSTCT) that effectively enlarges the FOV of the micro-CT and correspondingly developed a virtual projection-based filtered backprojection (V-FBP) algorithm for reconstruction. Although V-FBP skillfully solves the truncation problem in mSTCT, it requires densely sampled projections to arrive at high-resolution reconstruction, which reduces imaging efficiency. In this paper, we developed two backprojection-filtration (BPF)-based algorithms for mSTCT, i.e., S-BPF (derivatives along source) and D-BPF (derivatives along detector). D-BPF can achieve high-resolution reconstruction with fewer projections than V-FBP and S-BPF. Through simulated and real experiments conducted in this paper, we demonstrate that D-BPF can reduce source sampling by 75% compared with V-FBP at the same spatial resolution, which makes mSTCT more feasible in practice. Meanwhile, S-BPF can yield more stable results than D-BPF, which is similar to V-FBP.

NADec 26, 2022
Improved Laguerre Spectral Methods with Less Round-off Errors and Better Stability

Shenghe Huang, Haijun Yu

Laguerre polynomials are orthogonal polynomials defined on positive half line with respect to weight $e^{-x}$. They have wide applications in scientific and engineering computations. However, the exponential growth of Laguerre polynomials of high degree makes it hard to apply them to complicated systems that need to use large numbers of Laguerre bases. In this paper, we introduce modified three-term recurrence formula to reduce the round-off error and to avoid overflow and underflow issues in generating generalized Laguerre polynomials and Laguerre functions. We apply the improved Laguerre methods to solve an elliptic equation defined on the half line. More than one thousand Laguerre bases are used in this application and meanwhile accuracy close to machine precision is achieved. The optimal scaling factor of Laguerre methods are studied and found to be independent of number of quadrature points in two cases that Laguerre methods have better convergence speeds than mapped Jacobi methods.

DSSep 6, 2020
OnsagerNet: Learning Stable and Interpretable Dynamics using a Generalized Onsager Principle

Haijun Yu, Xinyuan Tian, Weinan E et al.

We propose a systematic method for learning stable and physically interpretable dynamical models using sampled trajectory data from physical processes based on a generalized Onsager principle. The learned dynamics are autonomous ordinary differential equations parameterized by neural networks that retain clear physical structure information, such as free energy, diffusion, conservative motion and external forces. For high dimensional problems with a low dimensional slow manifold, an autoencoder with metric preserving regularization is introduced to find the low dimensional generalized coordinates on which we learn the generalized Onsager dynamics. Our method exhibits clear advantages over existing methods on benchmark problems for learning ordinary differential equations. We further apply this method to study Rayleigh-Benard convection and learn Lorenz-like low dimensional autonomous reduced order models that capture both qualitative and quantitative properties of the underlying dynamics. This forms a general approach to building reduced order models for forced dissipative systems.

LGNov 7, 2019
ChebNet: Efficient and Stable Constructions of Deep Neural Networks with Rectified Power Units via Chebyshev Approximations

Shanshan Tang, Bo Li, Haijun Yu

In a previous study [B. Li, S. Tang and H. Yu, Commun. Comput. Phy. 27(2):379-411, 2020], it is shown that deep neural networks built with rectified power units (RePU) as activation functions can give better approximation for sufficient smooth functions than those built with rectified linear units, by converting polynomial approximations using power series into deep neural networks with optimal complexity and no approximation error. However, in practice, power series approximations are not easy to obtain due to the associated stability issue. In this paper, we propose a new and more stable way to construct RePU deep neural networks based on Chebyshev polynomial approximations. By using a hierarchical structure of Chebyshev polynomial approximation in frequency domain, we obtain efficient and stable deep neural network construction, which we call ChebNet. The approximation of smooth functions by ChebNets is no worse than the approximation by deep RePU nets using power series. On the same time, ChebNets are much more stable. Numerical results show that the constructed ChebNets can be further fine-tuned to obtain much better results than those obtained by tuning deep RePU nets constructed by power series approach. As spectral accuracy is hard to obtain by direct training of deep neural networks, ChebNets provide a practical way to obtain spectral accuracy, it is expected to be useful in real applications that require efficient approximations of smooth functions.

LGSep 9, 2019
PowerNet: Efficient Representations of Polynomials and Smooth Functions by Deep Neural Networks with Rectified Power Units

Bo Li, Shanshan Tang, Haijun Yu

Deep neural network with rectified linear units (ReLU) is getting more and more popular recently. However, the derivatives of the function represented by a ReLU network are not continuous, which limit the usage of ReLU network to situations only when smoothness is not required. In this paper, we construct deep neural networks with rectified power units (RePU), which can give better approximations for smooth functions. Optimal algorithms are proposed to explicitly build neural networks with sparsely connected RePUs, which we call PowerNets, to represent polynomials with no approximation error. For general smooth functions, we first project the function to their polynomial approximations, then use the proposed algorithms to construct corresponding PowerNets. Thus, the error of best polynomial approximation provides an upper bound of the best RePU network approximation error. For smooth functions in higher dimensional Sobolev spaces, we use fast spectral transforms for tensor-product grid and sparse grid discretization to get polynomial approximations. Our constructive algorithms show clearly a close connection between spectral methods and deep neural networks: a PowerNet with $n$ layers can exactly represent polynomials up to degree $s^n$, where $s$ is the power of RePUs. The proposed PowerNets have potential applications in the situations where high-accuracy is desired or smoothness is required.

IVMay 6, 2019
DLIMD: Dictionary Learning based Image-domain Material Decomposition for spectral CT

Weiwen Wu, Haijun Yu, Peijun Chen et al.

The potential huge advantage of spectral computed tomography (CT) is its capability to provide accuracy material identification and quantitative tissue information. This can benefit clinical applications, such as brain angiography, early tumor recognition, etc. To achieve more accurate material components with higher material image quality, we develop a dictionary learning based image-domain material decomposition (DLIMD) for spectral CT in this paper. First, we reconstruct spectral CT image from projections and calculate material coefficients matrix by selecting uniform regions of basis materials from image reconstruction results. Second, we employ the direct inversion (DI) method to obtain initial material decomposition results, and a set of image patches are extracted from the mode-1 unfolding of normalized material image tensor to train a united dictionary by the K-SVD technique. Third, the trained dictionary is employed to explore the similarities from decomposed material images by constructing the DLIMD model. Fourth, more constraints (i.e., volume conservation and the bounds of each pixel within material maps) are further integrated into the model to improve the accuracy of material decomposition. Finally, both physical phantom and preclinical experiments are employed to evaluate the performance of the proposed DLIMD in material decomposition accuracy, material image edge preservation and feature recovery.

NAApr 2, 2019
Efficient Second Order Unconditionally Stable Schemes for a Phase-field Moving Contact Line Model Using Invariant Energy Quadratization Approach

Xiaofeng Yang, Haijun Yu

We consider the numerical approximations for a phase field model consisting of incompressible Navier--Stokes equations with a generalized Navier boundary condition, and the Cahn-Hilliard equation with a dynamic moving contact line boundary condition. A crucial and challenging issue for solving this model numerically is the time marching problem, due to the high order, nonlinear, and coupled properties of the system. We solve this issue by developing two linear, second order accurate, and energy stable schemes based on the projection method for the Navier--Stokes equations, the invariant energy quadratization for the nonlinear gradient terms in the bulk and boundary, and a subtle implicit-explicit treatment for the stress and convective terms. The well-posedness of the semidiscretized system and the unconditional energy stabilities are proved. Various numerical results based on a spectral-Galerkin spatial discretization are presented to verify the accuracy and efficiency of the proposed schemes.

LGAug 4, 2018
Application of Bounded Total Variation Denoising in Urban Traffic Analysis

Shanshan Tang, Haijun Yu

While it is believed that denoising is not always necessary in many big data applications, we show in this paper that denoising is helpful in urban traffic analysis by applying the method of bounded total variation denoising to the urban road traffic prediction and clustering problem. We propose two easy-to-implement methods to estimate the noise strength parameter in the denoising algorithm, and apply the denoising algorithm to GPS-based traffic data from Beijing taxi system. For the traffic prediction problem, we combine neural network and history matching method for roads randomly chosen from an urban area of Beijing. Numerical experiments show that the predicting accuracy is improved significantly by applying the proposed bounded total variation denoising algorithm. We also test the algorithm on clustering problem, where a recently developed clustering analysis method is applied to more than one hundred urban road segments in Beijing based on their velocity profiles. Better clustering result is obtained after denoising.