NANov 11, 2015
Fast evaluation of the Caputo fractional derivative and its applications to fractional diffusion equationsShidong Jiang, Jiwei Zhang, Qian Zhang et al.
We present an efficient algorithm for the evaluation of the Caputo fractional derivative $_0^C\!D_t^αf(t)$ of order $α\in (0,1)$, which can be expressed as a convolution of $f'(t)$ with the kernel $t^{-α}$. The algorithm is based on an efficient sum-of-exponentials approximation for the kernel $t^{-1-α}$ on the interval $[Δt, T]$ with a uniform absolute error $\varepsilon$, where the number of exponentials $N_{\text{exp}}$ needed is of the order $O\left(\log\frac{1}{\varepsilon}\left( \log\log\frac{1}{\varepsilon}+\log\frac{T}{Δt}\right) +\log\frac{1}{Δt}\left( \log\log\frac{1}{\varepsilon}+\log\frac{1}{Δt}\right) \right)$. As compared with the direct method, the resulting algorithm reduces the storage requirement from $O(N_T)$ to $O(N_{\text{exp}})$ and the overall computational cost from $O(N_T^2)$ to $O(N_TN_{\text{exp}})$ with $N_T$ the total number of time steps. Furthermore, when the fast evaluation scheme of the Caputo derivative is applied to solve the fractional diffusion equations, the resulting algorithm requires only $O(N_SN_{\text{exp}})$ storage and $O(N_SN_TN_{\text{exp}})$ work with $N_S$ the total number of points in space; whereas the direct methods require $O(N_SN_T$) storage and $O(N_SN_T^2)$ work. The complexity of both algorithms is nearly optimal since $N_{\text{exp}}$ is of the order $O(\log N_T)$ for $T\gg 1$ or $O(\log^2N_T)$ for $T\approx 1$ for fixed accuracy $\varepsilon$. We also present a detailed stability and error analysis of the new scheme for solving linear fractional diffusion equations. The performance of the new algorithm is illustrated via several numerical examples. Finally, the algorithm can be parallelized in a straightforward manner.
NANov 20, 2018
Sharp $H^1$-norm error estimates of two time-stepping schemes for reaction-subdiffusion problemsJincheng Ren, Hong-lin Liao, Jiwei Zhang et al.
Due to the intrinsically initial singularity of solution and the discrete convolution form in numerical Caputo derivatives, the traditional $H^1$-norm analysis (corresponding to the case for a classical diffusion equation) to the time approximations of a fractional subdiffusion problem always leads to suboptimal error estimates (a loss of time accuracy). To recover the theoretical accuracy in time, we propose an improved discrete Grönwall inequality and apply it to the well-known L1 formula and a fractional Crank-Nicolson scheme. With the help of a time-space error-splitting technique and the global consistency analysis, sharp $H^1$-norm error estimates of the two nonuniform approaches are established for a reaction-subdiffusion problems. Numerical experiments are included to confirm the sharpness of our analysis.
NAFeb 15, 2017
Superconvergence of Immersed Finite Element Methods for Interface ProblemsWaixiang Cao, Xu Zhang, Zhimin Zhang
In this article, we study superconvergence properties of immersed finite element methods for the one dimensional elliptic interface problem. Due to low global regularity of the solution, classical superconvergence phenomenon for finite element methods disappears unless the discontinuity of the coefficient is resolved by partition. We show that immersed finite element solutions inherit all desired superconvergence properties from standard finite element methods without requiring the mesh to be aligned with the interface. In particular, on interface elements, superconvergence occurs at roots of generalized orthogonal polynomials that satisfy both orthogonality and interface jump conditions.
NAFeb 11, 2018
Ball Prolate Spheroidal Wave Functions In Arbitrary DimensionsJing Zhang, Huiyuan Li, Li-Lian Wang et al.
In this paper, we introduce the prolate spheroidal wave functions (PSWFs) of real order $α>-1$ on the unit ball in arbitrary dimension, termed as ball PSWFs. They are eigenfunctions of both a weighted concentration integral operator, and a Sturm-Liouville differential operator. Different from existing works on multi-dimensional PSWFs, the ball PSWFs are defined as a generalisation of orthogonal {\em ball polynomials} in primitive variables with a tuning parameter $c>0$, through a "perturbation" of the Sturm-Liouville equation of the ball polynomials. From this perspective, we can explore some interesting intrinsic connections between the ball PSWFs and the finite Fourier and Hankel transforms. We provide an efficient and accurate algorithm for computing the ball PSWFs and the associated eigenvalues, and present various numerical results to illustrate the efficiency of the method. Under this uniform framework, we can recover the existing PSWFs by suitable variable substitutions.
CVApr 10, 2022
Unsupervised Manga Character Re-identification via Face-body and Spatial-temporal Associated ClusteringZhimin Zhang, Zheng Wang, Wei Hu
In the past few years, there has been a dramatic growth in e-manga (electronic Japanese-style comics). Faced with the booming demand for manga research and the large amount of unlabeled manga data, we raised a new task, called unsupervised manga character re-identification. However, the artistic expression and stylistic limitations of manga pose many challenges to the re-identification problem. Inspired by the idea that some content-related features may help clustering, we propose a Face-body and Spatial-temporal Associated Clustering method (FSAC). In the face-body combination module, a face-body graph is constructed to solve problems such as exaggeration and deformation in artistic creation by using the integrity of the image. In the spatial-temporal relationship correction module, we analyze the appearance features of characters and design a temporal-spatial-related triplet loss to fine-tune the clustering. Extensive experiments on a manga book dataset with 109 volumes validate the superiority of our method in unsupervised manga character re-identification.
NAJun 21, 2016
Efficient Spectral and Spectral Element Methods for Eigenvalue Problems of Schrödinger Equations with an Inverse Square PotentialHuiyuan Li, Zhimin Zhang
In this article, we study numerical approximation of eigenvalue problems of the Schrödinger operator $\displaystyle -Δu + \frac{c^2}{|x|^2}u$. There are three stages in our investigation: We start from a ball of any dimension, in which case the exact solution in the radial direction can be expressed by Bessel functions of fractional degrees. This knowledge helps us to design two novel spectral methods by modifying the polynomial basis to fit the singularities of the eigenfunctions. At the second stage, we move to circular sectors in the two dimensional setting. Again the radial direction can be expressed by Bessel functions of fractional degrees. Only in the tangential direction some modifications are needed from stage one. At the final stage, we extend the idea to arbitrary polygonal domains. We propose a mortar spectral element approach: a polygonal domain is decomposed into several sub-domains with each singular corner including the origin covered by a circular sector, in which origin and corner singularities are handled similarly as in the former stages, and the remaining domains are either a standard quadrilateral/triangle or a quadrilateral/triangle with a circular edge, in which the traditional polynomial based spectral method is applied. All sub-domains are linked by mortar elements (note that we may have hanging nodes). In all three stages, exponential convergence rates are achieved. Numerical experiments indicate that our new methods are superior to standard polynomial based spectral (or spectral element) methods and $hp$-adaptive methods. Our study offers a new and effective way to handle eigenvalue problems of the Schrödinger operator including the Laplacian operator on polygonal domains with reentrant corners.
NAApr 11, 2017
An efficient spectral-Galerkin approximation and error analysis for Maxwell transmission eigenvalue problems in spherical geometriesJing An, zhimin Zhang
We propose and analyze an efficient spectral-Galerkin approximation for the Maxwell transmission eigenvalue problem in spherical geometry. Using a vector spherical harmonic expansion, we reduce the problem to a sequence of equivalent one-dimensional TE and TM modes that can be solved individually in parallel. For the TE mode, we derive associated generalized eigenvalue problems and corresponding pole conditions. Then we introduce weighted Sobolev spaces based on the pole condition and prove error estimates for the generalized eigenvalue problem. The TM mode is a coupled system with four unknown functions, which is challenging for numerical calculation. To handle it, we design an effective algorithm using Legendre-type vector basis functions. Finally, we provide some numerical experiments to validate our theoretical results and demonstrate the efficiency of the algorithms.
NADec 31, 2016
Supercloseness analysis and polynomial preserving recovery for a class of weak Galerkin methodRuishu Wang, Ran Zhang, Xu Zhang et al.
In this paper, we analyze convergence and supercloseness properties of a class of weak Galerkin (WG) finite element methods for solving second-order elliptic problems. It is shown that the WG solution is superclose to the Lagrange type interpolation using Lobatto points. This supercloseness behavior is obtained through some newly designed stabilization terms. A post-processing technique using the polynomial preserving recovery (PPR) is introduced for WG approximation. Superconvergence analysis is carried out for the PPR approximation. Numerical examples are provided to verify our theoretical results.
NASep 29, 2017
Superconvergence Points For The Spectral Interpolation Of Riesz Fractional DerivativesBeichuan Deng, Zhimin Zhang, Xuan Zhao
In this paper, superconvergence points are located for the approximation of the Riesz derivative of order $α$ using classical Lobatto-type polynomials when $α\in (0,1)$ and generalized Jacobi functions (GJF) for arbitrary $α> 0$, respectively. For the former, superconvergence points are zeros of the Riesz fractional derivative of the leading term in the truncated Legendre-Lobatto expansion. It is observed that the convergence rate for different $α$ at the superconvergence points is at least $O(N^{-2})$ better than the optimal global convergence rate. Furthermore, the interpolation is generalized to the Riesz derivative of order $α> 1$ with the help of GJF, which deal well with the singularities. The well-posedness, convergence and superconvergence properties are theoretically analyzed. The gain of the convergence rate at the superconvergence points is analyzed to be $O(N^{-(α+3)/2})$ for $α\in (0,1)$ and $O(N^{-2})$ for $α> 1$. Finally, we apply our findings in solving model FDEs and observe that the convergence rates are indeed much better at the predicted superconvergence points.
NAApr 14, 2018
A $C^0$ linear finite element method for sixth order elliptic equationsHailong Guo, Zhimin Zhang, Qingsong Zou
In this paper, we develop a straightforward $C^0$ linear finite element method for sixth-order elliptic equations. The basic idea is to use gradient recovery techniques to generate higher-order numerical derivatives from a $C^0$ linear finite element function. Both theoretical analysis and numerical experiments show that the proposed method has the optimal convergence rate under the energy norm. The method avoids complicated construction of conforming $C^2$ finite element basis or nonconforming penalty terms and has a low computational cost.
NAApr 26, 2012
Superconvergence Points of Spectral InterpolationZhimin Zhang
In this work, we study superconvergence properties for some high-order orthogonal polynomial interpolations.The results are two-folds: When interpolating function values, we identify those points where the first and second derivatives of the interpolant converge faster;When interpolating the first derivative,we locate those points where the function value of the interpolant superconverges. For the earlier case, we use various Chebyshev polynomials; and for the later case,we also include the counterpart Legendre polynomials.
NAJul 3, 2012
Any order superconvergence finite volume schemes for 1D general elliptic equationsWaixiang Cao, Zhimin Zhang, Qingsong Zou
We present and analyze a finite volume scheme of arbitrary order for elliptic equations in the one-dimensional setting. In this scheme, the control volumes are constructed by using the Gauss points in subintervals of the underlying mesh. We provide a unified proof for the inf-sup condition, and show that our finite volume scheme has optimal convergence rate under the energy and $L^2$ norms of the approximate error. Furthermore, we prove that the derivative error is superconvergent at all Gauss points and in some special case, the convergence rate can reach $h^{2r}$, where $r$ is the polynomial degree of the trial space. All theoretical results are justified by numerical tests.
NAMar 1, 2017
Superconvergence analysis of linear FEM based on the polynomial preserving recovery and Richardson extrapolation for Helmholtz equation with high wave numberYu Du, Haijun Wu, Zhimin Zhang
We study superconvergence property of the linear finite element method with the polynomial preserving recovery (PPR) and Richardson extrapolation for the two dimensional Helmholtz equation. The $H^1$-error estimate with explicit dependence on the wave number $k$ {is} derived. First, we prove that under the assumption $k(kh)^2\leq C_0$ ($h$ is the mesh size) and certain mesh condition, the estimate between the finite element solution and the linear interpolation of the exact solution is superconvergent under the $H^1$-seminorm, although the pollution error still exists. Second, we prove a similar result for the recovered gradient by PPR and found that the PPR can only improve the interpolation error and has no effect on the pollution error. Furthermore, we estimate the error between the finite element gradient and recovered gradient and discovered that the pollution error is canceled between these two quantities. Finally, we apply the Richardson extrapolation to recovered gradient and demonstrate numerically that PPR combined with the Richardson extrapolation can reduce the interpolation and pollution errors simultaneously, and therefore, leads to an asymptotically exact {\it a posteriori} error estimator. All theoretical findings are verified by numerical tests.
NAMay 9, 2012
Flux-conserving finite element methodsShangyou Zhang, Zhimin Zhang, Qingsong Zou
We analyze the flux conservation property of the finite element method. It is shown that the finite element solution does approximate the flux locally in the optimal order, i.e., the same order as that of the nodal interpolation operator. We propose two methods, post-processing the finite element solutions locally. The new solutions, remaining as optimal-order solutions, are flux-conserving elementwise. In one of our methods, the processed solution also satisfies the original finite element equations. While the high-order finite volume schemes are still under construction, our methods produce finite-volume-like finite element solution of any order. In particular, our methods avoid solving non-symmetric finite volume equations. Numerical tests in 2D and 3D verify our findings.
NAMay 8, 2018
An $H^2$(curl)-conforming finite element in 2D and its applications to the quad-curl problemQian Zhang, Lixiu Wang, Zhimin Zhang
In this paper, we first construct the $H^2$(curl)-conforming finite elements both on a rectangle and a triangle. They possess some fascinating properties which have been proven by a rigorous theoretical analysis. Then we apply the elements to construct a finite element space for discretizing quad-curl problems. Convergence orders $O(h^k)$ in the $H$(curl) norm and $O(h^{k-1})$ in the $H^2$(curl) norm are established. Numerical experiments are provided to confirm our theoretical results.
NAMar 21, 2018
Superconvergence Points of Integer and Fractional Derivatives of Special Hermite Interpolations and Its Applications in Solving FDEsBeichuan Deng, Jiwei Zhang, Zhimin Zhang
In this paper, we study convergence and superconvergence theory of integer and fractional derivatives of the one-point and the two-point Hermite interpolations. When considering the integer-order derivative, exponential decay of the error is proved, and superconvergence points are located, at which the convergence rates are $O(N^{-2})$ and $O(N^{-1.5})$, respectively, better than the global rate for the one-point and two-point interpolations. Here $N$ represents the degree of interpolation polynomial. It is proved that the $α$-th fractional derivative of $(u-u_N)$ with $k<α<k+1$, is bounded by its $(k+1)$-th derivative. Furthermore, the corresponding superconvergence points are predicted for fractional derivatives, and an eigenvalue method is proposed to calculate the superconvergence points for the Riemann-Liouville fractional derivative. In the application of the knowledge of superconvergence points to solve FDEs, we discover that a modified collocation method makes numerical solutions much more accurate than the traditional collocation method.
NAFeb 8, 2017
Superconvergence analysis of partially penalized immersed finite element methodHailong Guo, Xu Yang, Zhimin Zhang
The contribution of this paper contains two parts: first, we prove a supercloseness result for the partially penalized immersed finite element (PPIFE) method in [T. Lin, Y. Lin, and X. Zhang, SIAM J. Numer. Anal., 53 (2015), 1121--1144]; then based on the supercloseness result, we show that the gradient recovery method proposed in our previous work [H. Guo and X. Yang, arXiv: 1608.00063] can be applied to the PPIFE method and the recovered gradient converges to the exact gradient with a superconvergent rate $\mathcal{O}(h^{3/2})$. Hence, the gradient recovery method provides an asymptotically exact a posteriori error estimator for the PPIFE method. Several numerical examples are presented to verify our theoretical result.
NAJul 3, 2012
Finite volume schemes of any order on rectangular meshesZhimin Zhang, Qingsong Zou
In this paper, we analyze vertex-centered finite volume method (FVM) of any order for elliptic equations on rectangular meshes. The novelty is a unified proof of the inf-sup condition, based on which, we show that the FVM approximation converges to the exact solution with the optimal rate in the energy norm. Furthermore, we discuss superconvergence property of the FVM solution. With the help of this superconvergence result, we find that the FVM solution also converges to the exact solution with the optimal rate in the $L^2$-norm. Finally, we validate our theory with several numerical experiments.
NADec 11, 2016
Superconvergence analysis of DG-FEM based on the polynomial preserving recovery for Helmholtz equation with high wave numberYu Du, Zhimin Zhang
We study superconvergence property of the linear discontinuous Galerkin finite element method with the polynomial preserving recovery (PPR) and Richardson extrapolation for the two dimensional Helmholtz equation. The error estimate with explicit dependence on the wave number $k$, the penalty parameter $μ$ and the mesh condition parameter $α$ is derived. First, we prove that under the assumption $k(kh)^2\leq C_0$ ($h$ is the mesh size) and certain mesh condition, the estimate between the finite element solution and the linear interpolation of the exact solution is superconvergent under the $\norme{\cdot}$-seminorm. Second, we prove a superconvergence result for the recovered gradient by PPR. Furthermore, we estimate the error between the finite element gradient and recovered gradient, which motivate us to define the a posteriori error estimator. Finally, Some numerical examples are provided to confirm the theoretical results of superconvergence analysis. All theoretical findings are verified by numerical tests.
NAOct 27, 2016
Spectral-Galerkin Approximation and Optimal Error Estimate for Stokes Eigenvalue Problems in Polar GeometriesJing An, Huiyuan Li, Zhimin Zhang
In this paper we propose and analyze spectral-Galerkin methods for the Stokes eigenvalue problem based on the stream function formulation in polar geometries. We first analyze the stream function} formulated fourth-order equation under the polar coordinates, then we derive the pole condition and reduce the problem on a circular disk to a sequence of equivalent one-dimensional eigenvalue problems that can be solved in parallel. The novelty of our approach lies in the construction} of suitably weighted Sobolev spaces according to the pole conditions, based on which, the optimal error estimate for approximated eigenvalue of each one dimensional problem can be obtained. Further, we extend our method to the non-separable Stokes eigenvalue problem in an elliptic domain and establish the optimal error bounds. Finally, we provide some numerical experiments to validate our theoretical results and algorithms.
IRJul 1, 2024Code
Heterogeneous Graph-based Framework with Disentangled Representations Learning for Multi-target Cross Domain RecommendationXiaopeng Liu, Juan Zhang, Chongqi Ren et al.
CDR (Cross-Domain Recommendation), i.e., leveraging information from multiple domains, is a critical solution to data sparsity problem in recommendation system. The majority of previous research either focused on single-target CDR (STCDR) by utilizing data from the source domains to improve the model's performance on the target domain, or applied dual-target CDR (DTCDR) by integrating data from the source and target domains. In addition, multi-target CDR (MTCDR) is a generalization of DTCDR, which is able to capture the link among different domains. In this paper we present HGDR (Heterogeneous Graph-based Framework with Disentangled Representations Learning), an end-to-end heterogeneous network architecture where graph convolutional layers are applied to model relations among different domains, meanwhile utilizes the idea of disentangling representation for domain-shared and domain-specifc information. First, a shared heterogeneous graph is generated by gathering users and items from several domains without any further side information. Second, we use HGDR to compute disentangled representations for users and items in all domains. Experiments on real-world datasets and online A/B tests prove that our proposed model can transmit information among domains effectively and reach the SOTA performance. The code can be found here: https://github.com/NetEase-Media/HGCDR.
CVMar 15, 2024
RangeLDM: Fast Realistic LiDAR Point Cloud GenerationQianjiang Hu, Zhimin Zhang, Wei Hu
Autonomous driving demands high-quality LiDAR data, yet the cost of physical LiDAR sensors presents a significant scaling-up challenge. While recent efforts have explored deep generative models to address this issue, they often consume substantial computational resources with slow generation speeds while suffering from a lack of realism. To address these limitations, we introduce RangeLDM, a novel approach for rapidly generating high-quality range-view LiDAR point clouds via latent diffusion models. We achieve this by correcting range-view data distribution for accurate projection from point clouds to range images via Hough voting, which has a critical impact on generative learning. We then compress the range images into a latent space with a variational autoencoder, and leverage a diffusion model to enhance expressivity. Additionally, we instruct the model to preserve 3D structural fidelity by devising a range-guided discriminator. Experimental results on KITTI-360 and nuScenes datasets demonstrate both the robust expressiveness and fast speed of our LiDAR point cloud generation.
NAJun 8, 2022
$\bf H(\mathrm{curl}^2)$ conforming element for Maxwell's transmission eigenvalue problem using fixed-point approachJiayu Han, Zhimin Zhang
Using newly developed ${\bf H}(\mathrm{curl}^2)$ conforming elements, we solve the Maxwell's transmission eigenvalue problem. Both real and complex eigenvalues are considered. Based on the fixed-point weak formulation with reasonable assumptions, the optimal error estimates for numerical eigenvalues and eigenfunctions (in the ${\bf H}(\mathrm{curl}^2)$-norm and ${\bf H}(\mathrm{curl})$-semi-norm) are established.
CVJan 8, 2024
InvariantOODG: Learning Invariant Features of Point Clouds for Out-of-Distribution GeneralizationZhimin Zhang, Xiang Gao, Wei Hu
The convenience of 3D sensors has led to an increase in the use of 3D point clouds in various applications. However, the differences in acquisition devices or scenarios lead to divergence in the data distribution of point clouds, which requires good generalization of point cloud representation learning methods. While most previous methods rely on domain adaptation, which involves fine-tuning pre-trained models on target domain data, this may not always be feasible in real-world scenarios where target domain data may be unavailable. To address this issue, we propose InvariantOODG, which learns invariability between point clouds with different distributions using a two-branch network to extract local-to-global features from original and augmented point clouds. Specifically, to enhance local feature learning of point clouds, we define a set of learnable anchor points that locate the most useful local regions and two types of transformations to augment the input point clouds. The experimental results demonstrate the effectiveness of the proposed model on 3D domain generalization benchmarks.
CVJul 10, 2025
Behave Your Motion: Habit-preserved Cross-category Animal Motion TransferZhimin Zhang, Bi'an Du, Caoyuan Ma et al.
Animal motion embodies species-specific behavioral habits, making the transfer of motion across categories a critical yet complex task for applications in animation and virtual reality. Existing motion transfer methods, primarily focused on human motion, emphasize skeletal alignment (motion retargeting) or stylistic consistency (motion style transfer), often neglecting the preservation of distinct habitual behaviors in animals. To bridge this gap, we propose a novel habit-preserved motion transfer framework for cross-category animal motion. Built upon a generative framework, our model introduces a habit-preservation module with category-specific habit encoder, allowing it to learn motion priors that capture distinctive habitual characteristics. Furthermore, we integrate a large language model (LLM) to facilitate the motion transfer to previously unobserved species. To evaluate the effectiveness of our approach, we introduce the DeformingThings4D-skl dataset, a quadruped dataset with skeletal bindings, and conduct extensive experiments and quantitative analyses, which validate the superiority of our proposed model.
CVJun 21, 2024
Multi-Granularity and Multi-modal Feature Interaction Approach for Text Video RetrievalWenjun Li, Shudong Wang, Dong Zhao et al.
The key of the text-to-video retrieval (TVR) task lies in learning the unique similarity between each pair of text (consisting of words) and video (consisting of audio and image frames) representations. However, some problems exist in the representation alignment of video and text, such as a text, and further each word, are of different importance for video frames. Besides, audio usually carries additional or critical information for TVR in the case that frames carry little valid information. Therefore, in TVR task, multi-granularity representation of text, including whole sentence and every word, and the modal of audio are salutary which are underutilized in most existing works. To address this, we propose a novel multi-granularity feature interaction module called MGFI, consisting of text-frame and word-frame, for video-text representations alignment. Moreover, we introduce a cross-modal feature interaction module of audio and text called CMFI to solve the problem of insufficient expression of frames in the video. Experiments on benchmark datasets such as MSR-VTT, MSVD, DiDeMo show that the proposed method outperforms the existing state-of-the-art methods.
CLSep 27, 2021
Improving Stack Overflow question title generation with copying enhanced CodeBERT model and bi-modal informationFengji Zhang, Xiao Yu, Jacky Keung et al.
Context: Stack Overflow is very helpful for software developers who are seeking answers to programming problems. Previous studies have shown that a growing number of questions are of low quality and thus obtain less attention from potential answerers. Gao et al. proposed an LSTM-based model (i.e., BiLSTM-CC) to automatically generate question titles from the code snippets to improve the question quality. However, only using the code snippets in the question body cannot provide sufficient information for title generation, and LSTMs cannot capture the long-range dependencies between tokens. Objective: This paper proposes CCBERT, a deep learning based novel model to enhance the performance of question title generation by making full use of the bi-modal information of the entire question body. Method: CCBERT follows the encoder-decoder paradigm and uses CodeBERT to encode the question body into hidden representations, a stacked Transformer decoder to generate predicted tokens, and an additional copy attention layer to refine the output distribution. Both the encoder and decoder perform the multi-head self-attention operation to better capture the long-range dependencies. This paper builds a dataset containing around 200,000 high-quality questions filtered from the data officially published by Stack Overflow to verify the effectiveness of the CCBERT model. Results: CCBERT outperforms all the baseline models on the dataset. Experiments on both code-only and low-resource datasets show the superiority of CCBERT with less performance degradation. The human evaluation also shows the excellent performance of CCBERT concerning both readability and correlation criteria.
CLSep 17, 2021
Hierarchy-Aware T5 with Path-Adaptive Mask Mechanism for Hierarchical Text ClassificationWei Huang, Chen Liu, Yihua Zhao et al.
Hierarchical Text Classification (HTC), which aims to predict text labels organized in hierarchical space, is a significant task lacking in investigation in natural language processing. Existing methods usually encode the entire hierarchical structure and fail to construct a robust label-dependent model, making it hard to make accurate predictions on sparse lower-level labels and achieving low Macro-F1. In this paper, we propose a novel PAMM-HiA-T5 model for HTC: a hierarchy-aware T5 model with path-adaptive mask mechanism that not only builds the knowledge of upper-level labels into low-level ones but also introduces path dependency information in label prediction. Specifically, we generate a multi-level sequential label structure to exploit hierarchical dependency across different levels with Breadth-First Search (BFS) and T5 model. To further improve label dependency prediction within each path, we then propose an original path-adaptive mask mechanism (PAMM) to identify the label's path information, eliminating sources of noises from other paths. Comprehensive experiments on three benchmark datasets show that our novel PAMM-HiA-T5 model greatly outperforms all state-of-the-art HTC approaches especially in Macro-F1. The ablation studies show that the improvements mainly come from our innovative approach instead of T5.
CVDec 31, 2018
Unsupervised monocular stereo matchingZhimin Zhang, Jianzhong Qiao, Shukuan Lin
At present, deep learning has been applied more and more in monocular image depth estimation and has shown promising results. The current more ideal method for monocular depth estimation is the supervised learning based on ground truth depth, but this method requires an abundance of expensive ground truth depth as the supervised labels. Therefore, researchers began to work on unsupervised depth estimation methods. Although the accuracy of unsupervised depth estimation method is still lower than that of supervised method, it is a promising research direction. In this paper, Based on the experimental results that the stereo matching models outperforms monocular depth estimation models under the same unsupervised depth estimation model, we proposed an unsupervised monocular vision stereo matching method. In order to achieve the monocular stereo matching, we constructed two unsupervised deep convolution network models, one was to reconstruct the right view from the left view, and the other was to estimate the depth map using the reconstructed right view and the original left view. The two network models are piped together during the test phase. The output results of this method outperforms the current mainstream unsupervised depth estimation method in the challenging KITTI dataset.
NAAug 28, 2017
Acceleration of weak Galerkin methods for the Laplacian eigenvalue problemQilong Zhai, Hehu Xie, Ran Zhang et al.
Recently, we proposed a weak Galerkin finite element method for the Laplace eigenvalue problem. In this paper, we present two-grid and two-space skills to accelerate the weak Galerkin method. By choosing parameters properly, the two-grid and two-space weak Galerkin method not only doubles the convergence rate, but also maintains the asymptotic lower bounds property of the weak Galerkin method. Some numerical examples are provided to validate our theoretical analysis.
NAAug 3, 2017
Superconvergence of immersed finite volume methods for one-dimensional interface problemsWaixiang Cao, Xu Zhang, Zhimin Zhang et al.
In this paper, we introduce a class of high order immersed finite volume methods (IFVM) for one-dimensional interface problems. We show the optimal convergence of IFVM in H1 and L2 norms. We also prove some superconvergence results of IFVM. To be more precise, the IFVM solution is superconvergent of order p+2 at the roots of generalized Lobatto polynomials, and the flux is superconvergent of order p+1 at generalized Gauss points on each element including the interface element. Furthermore, for diffusion interface problems, the convergence rates for IFVM solution at the mesh points and the flux at generalized Gaussian points can both be raised to 2p. These superconvergence results are consistent with those for the standard finite volume methods. Numerical examples are provided to confirm our theoretical analysis.
NASep 24, 2016
A Multilevel Correction Scheme for Nonsymmetric Eigenvalue Problems by Finite Element MethodsHehu Xie, Zhimin Zhang
A multilevel correction scheme is proposed to solve defective and nodefective of nonsymmetric partial differential operators by the finite element method. The method includes multi correction steps in a sequence of finite element spaces. In each correction step, we only need to solve two source problems on a finer finite element space and two eigenvalue problems on the coarsest finite element space. The accuracy of the eigenpair approximation is improved after each correction step. This correction scheme improves overall efficiency of the finite element method in solving nonsymmetric eigenvalue problems.
NASep 15, 2016
Robust residual-based a posteriori error estimators for mixed finite element methods for fourth order elliptic singularly perturbed problemsShaohong Du, Runchang Lin, Zhimin Zhang
We consider mixed finite element approximation of a singularly perturbed fourth-order elliptic problem with two different boundary conditions, and present a new measure of the error, whose components are balanced with respect to the perturbation parameter. Robust residual-based a posteriori estimators for the new measure are obtained, which are achieved via a novel analytical technique based on an approximation result. Numerical examples are presented to validate our theory.
NASep 14, 2016
Robust recovery-type a posteriori error estimators for streamline upwind/Petrov Galerkin discretizations for singularly perturbed problemsShaohong Du, Runchang Lin, Zhimin Zhang
In this paper, we investigate adaptive streamline upwind/Petrov Galerkin (SUPG) methods for singularly perturbed convection-diffusion-reaction equations in a new dual norm presented in [Du and Zhang, J. Sci. Comput. (2015)]. The flux is recovered by either local averaging in conforming $H({\rm div})$ spaces or weighted global $L^2$ projection onto conforming $H({\rm div})$ spaces. We further introduce a recovery stabilization procedure, and develop completely robust a posteriori error estimators with respect to the singular perturbation parameter $\varepsilon$. Numerical experiments are reported to support the theoretical results and to show that the estimated errors depend on the degrees of freedom uniformly in $\varepsilon$.