NAFeb 1, 2012
Weak Galerkin Methods for Second Order Elliptic Interface ProblemsLin Mu, Junping Wang, Guowei Wei et al.
Weak Galerkin methods refer to general finite element methods for PDEs in which differential operators are approximated by their weak forms as distributions. Such weak forms give rise to desirable flexibilities in enforcing boundary and interface conditions. A weak Galerkin finite element method (WG-FEM) is developed in this paper for solving elliptic partial differential equations (PDEs) with discontinuous coefficients and interfaces. The paper also presents many numerical tests for validating the WG-FEM for solving second order elliptic interface problems. For such interface problems, the solution possesses a certain singularity due to the nonsmoothness of the interface. A challenge in research is to design high order numerical methods that work well for problems with low regularity in the solution. The best known numerical scheme in the literature is of order one for the solution itself in $L_\infty$ norm. It is demonstrated that the WG-FEM of lowest order is capable of delivering numerical approximations that are of order 1.75 in the usual $L_\infty$ norm for $C^1$ or Lipschitz continuous interfaces associated with a $C^1$ or $H^2$ continuous solutions. Theoretically, it is proved that high order of numerical schemes can be designed by using the WG-FEM with polynomials of high order on each element.
NAMar 5, 2013
Weak Galerkin Finite Element Methods for the Biharmonic Equation on Polytopal MeshesLin Mu, Junping Wang, Xiu Ye
A new weak Galerkin (WG) finite element method is introduced and analyzed in this paper for the biharmonic equation in its primary form. This method is highly robust and flexible in the element construction by using discontinuous piecewise polynomials on general finite element partitions consisting of polygons or polyhedra of arbitrary shape. The resulting WG finite element formulation is symmetric, positive definite, and parameter-free. Optimal order error estimates in a discrete $H^2$ norm is established for the corresponding WG finite element solutions. Error estimates in the usual $L^2$ norm are also derived, yielding a sub-optimal order of convergence for the lowest order element and an optimal order of convergence for all high order of elements. Numerical results are presented to confirm the theory of convergence under suitable regularity assumptions.
NANov 2, 2011
A Computational Study of the Weak Galerkin Method for Second-Order Elliptic EquationsLin Mu, Junping Wang, Yanqiu Wang et al.
The weak Galerkin finite element method is a novel numerical method that was first proposed and analyzed by Wang and Ye for general second order elliptic problems on triangular meshes. The goal of this paper is to conduct a computational investigation for the weak Galerkin method for various model problems with more general finite element partitions. The numerical results confirm the theory established by Wang and Ye. The results also indicate that the weak Galerkin method is efficient, robust, and reliable in scientific computing.
NAAug 17, 2012
Weak Galerkin Finite Element Methods on Polytopal MeshesLin Mu, Junping Wang, Xiu Ye
This paper introduces a new weak Galerkin (WG) finite element method for second order elliptic equations on polytopal meshes. This method, called WG-FEM, is designed by using a discrete weak gradient operator applied to discontinuous piecewise polynomials on finite element partitions of arbitrary polytopes with certain shape regularity. The paper explains how the numerical schemes are designed and why they provide reliable numerical approximations for the underlying partial differential equations. In particular, optimal order error estimates are established for the corresponding WG-FEM approximations in both a discrete $H^1$ norm and the standard $L^2$ norm. Numerical results are presented to demonstrate the robustness, reliability, and accuracy of the WG-FEM. All the results are derived for finite element partitions with polytopes. Allowing the use of discontinuous approximating functions on arbitrary polytopal elements is a highly demanded feature for numerical algorithms in scientific computing.
NADec 4, 2012
A Weak Galerkin Mixed Finite Element Method for Biharmonic EquationsLin Mu, Junping Wang, Yanqiu Wang et al.
This article introduces and analyzes a weak Galerkin mixed finite element method for solving the biharmonic equation. The weak Galerkin method, first introduced by two of the authors (J. Wang and X. Ye) in an earlier publication for second order elliptic problems, is based on the concept of discrete weak gradients. The method allows the use of completely discrete finite element functions on partitions of arbitrary polygon or polyhedron. In this article, the idea of weak Galerkin method is applied to discretize the Ciarlet-Raviart mixed formulation for the biharmonic equation. In particular, an a priori error estimation is given for the corresponding finite element approximations. The error analysis essentially follows the framework of Babuska, Osborn, and Pitkaranta and uses specially designed mesh-dependent norms. The proof is technically tedious due to the discontinuous nature of the weak Galerkin finite element functions. Some computational results are presented to demonstrate the efficiency of the method.
NAFeb 29, 2016
A Discrete Divergence-Free Weak Galerkin Finite Element Method for the Stokes EquationsLin Mu, Junping Wang, Xiu Ye
A discrete divergence-free weak Galerkin finite element method is developed for the Stokes equations based on a weak Galerkin (WG) method introduced in the reference [15]. Discrete divergence-free bases are constructed explicitly for the lowest order weak Galerkin elements in two and three dimensional spaces. These basis functions can be derived on general meshes of arbitrary shape of polygons and polyhedrons. With the divergence-free basis derived, the discrete divergence-free WG scheme can eliminate the pressure variable from the system and reduces a saddle point problem to a symmetric and positive definite system with many fewer unknowns. Numerical results are presented to demonstrate the robustness and accuracy of this discrete divergence-free WG method.
NADec 2, 2012
A C^0-Weak Galerkin Finite Element Method for the Biharmonic EquationLin Mu, Junping Wang, Xiu Ye et al.
A C^0-weak Galerkin (WG) method is introduced and analyzed for solving the biharmonic equation in 2D and 3D. A weak Laplacian is defined for C^0 functions in the new weak formulation. This WG finite element formulation is symmetric, positive definite and parameter free. Optimal order error estimates are established in both a discrete H^2 norm and the L^2 norm, for the weak Galerkin finite element solution. Numerical results are presented to confirm the theory. As a technical tool, a refined Scott-Zhang interpolation operator is constructed to assist the corresponding error estimate. This refined interpolation preserves the volume mass of order (k+1-d) and the surface mass of order (k+2-d) for the P_{k+2} finite element functions in d-dimensional space.
NAOct 17, 2016
A simple finite element method for the Stokes equationsLin Mu, Xiu Ye
The goal of this paper is to introduce a simple finite element method to solve the Stokes and the Navier-Stokes equations. This method is in primal velocity-pressure formulation and is so simple such that both velocity and pressure are approximated by piecewise constant functions. Implementation issues as well as error analysis are investigated. A basis for a divergence free subspace of the velocity field is constructed so that the original saddle point problem can be reduced to a symmetric and positive definite system with much fewer unknowns. The numerical experiments indicate that the method is accurate and robust.
NAFeb 29, 2016
Effective Implementation of the Weak Galerkin Finite Element Methods for the Biharmonic EquationLin Mu, Junping Wang, Xiu Ye
The weak Galerkin (WG) methods have been introduced in the references [11, 16] for solving the biharmonic equation. The purpose of this paper is to develop an algorithm to implement the WG methods effectively. This can be achieved by eliminating local unknowns to obtain a global system with significant reduction of size. In fact, this reduced global system is equivalent to the Schur complements of the WG methods. The unknowns of the Schur complement of the WG method are those defined on the element boundaries. The equivalence of the WG method and its Schur complement is established. The numerical results demonstrate the effectiveness of this new implementation technique.
LGApr 7Code
TalkLoRA: Communication-Aware Mixture of Low-Rank Adaptation for Large Language ModelsLin Mu, Haiyang Wang, Li Ni et al.
Low-Rank Adaptation (LoRA) enables parameter-efficient fine-tuning of Large Language Models (LLMs), and recent Mixture-of-Experts (MoE) extensions further enhance flexibility by dynamically combining multiple LoRA experts. However, existing MoE-augmented LoRA methods assume that experts operate independently, often leading to unstable routing, expert dominance. In this paper, we propose \textbf{TalkLoRA}, a communication-aware MoELoRA framework that relaxes this independence assumption by introducing expert-level communication prior to routing. TalkLoRA equips low-rank experts with a lightweight Talking Module that enables controlled information exchange across expert subspaces, producing a more robust global signal for routing. Theoretically, we show that expert communication smooths routing dynamics by mitigating perturbation amplification while strictly generalizing existing MoELoRA architectures. Empirically, TalkLoRA consistently outperforms vanilla LoRA and MoELoRA across diverse language understanding and generation tasks, achieving higher parameter efficiency and more balanced expert routing under comparable parameter budgets. These results highlight structured expert communication as a principled and effective enhancement for MoE-based parameter-efficient adaptation. Code is available at https://github.com/why0129/TalkLoRA.
IVMar 24
L-UNet: An LSTM Network for Remote Sensing Image Change DetectionShuting Sun, Lin Mu, Lizhe Wang et al.
Change detection of high-resolution remote sensing images is an important task in earth observation and was extensively investigated. Recently, deep learning has shown to be very successful in plenty of remote sensing tasks. The current deep learning-based change detection method is mainly based on conventional long short-term memory (Conv-LSTM), which does not have spatial characteristics. Since change detection is a process with both spatiality and temporality, it is necessary to propose an end-to-end spatiotemporal network. To achieve this, Conv-LSTM, an extension of the Conv-LSTM structure, is introduced. Since it shares similar spatial characteristics with the convolutional layer, L-UNet, which substitutes partial convolution layers of UNet-to-Conv-LSTM and Atrous L-UNet (AL-UNet), which further using Atrous structure to multiscale spatial information is proposed. Experiments on two data sets are conducted and the proposed methods show the advantages both in quantity and quality when compared with some other methods.
NAJan 31, 2018
An Immersed Weak Galerkin Method For Elliptic Interface ProblemsLin Mu, Xu Zhang
In this paper, we present an immersed weak Galerkin method for solving second-order elliptic interface problems. The proposed method does not require the meshes to be aligned with the interface. Consequently, uniform Cartesian meshes can be used for nontrivial interfacial geometry. We show the existence and uniqueness of the numerical algorithm, and prove the error estimates for the energy norm. Numerical results are reported to demonstrate the performance of the method.
NANov 2, 2011
A Numerical Study on the Weak Galerkin Method for the Helmholtz Equation with Large Wave NumbersLin Mu, Junping Wang, Xiu Ye et al.
Weak Galerkin (WG) refers to general finite element methods for partial differential equations in which differential operators are approximated by weak forms through the usual integration by parts. In particular, WG methods allow the use of discontinuous finite element functions in the algorithm design. One of such examples was recently introduced by Wang and Ye for solving second order elliptic problems. The goal of this paper is to apply the WG method of Wang and Ye to the Helmholtz equation with high wave numbers. Several test scenarios are designed for a numerical investigation on the accuracy, convergence, and robustness of the WG method in both inhomogeneous and homogeneous media over convex and non-convex domains. Our numerical experiments indicate that weak Galerkin is a finite element technique that is easy to implement, and provides very accurate and robust numerical solutions for the Helmholtz problem with high wave numbers.
LGApr 12Code
Polynomial Expansion Rank Adaptation: Enhancing Low-Rank Fine-Tuning with High-Order InteractionsWenhao Zhang, Lin Mu, Li Ni et al.
Low-rank adaptation (LoRA) is a widely used strategy for efficient fine-tuning of large language models (LLMs), but its strictly linear structure fundamentally limits expressive capacity. The bilinear formulation of weight updates captures only first-order dependencies between low-rank factors, restricting the modeling of nonlinear and higher-order parameter interactions. In this paper, we propose Polynomial Expansion Rank Adaptation (PERA), a novel method that introduces structured polynomial expansion directly into the low-rank factor space. By expanding each low-rank factor to synthesize high-order interaction terms before composition, PERA transforms the adaptation space into a polynomial manifold capable of modeling richer nonlinear coupling without increasing rank or inference cost. We provide theoretical analysis demonstrating that PERA offers enhanced expressive capacity and more effective feature utilization compare to existing linear adaptation approaches. Empirically, PERA consistently outperforms state-of-the-art methods across diverse benchmarks. Notably, our experiments show that incorporating high-order nonlinear components particularly square terms is crucial for enhancing expressive capacity and maintaining strong and robust performance under various rank settings. Our code is available at https://github.com/zhangwenhao6/PERA
NAFeb 12, 2018
A Domain-Decomposition Model Reduction Method for Linear Convection-Diffusion Equations with Random CoefficientsLin Mu, Guannan Zhang
We develop a domain-decomposition model reduction method for linear steady-state convection-diffusion equations with random coefficients. Of particular interest to this effort are the diffusion equations with random diffusivities, and the convection-dominated transport equations with random velocities. We investigate the equations with two types of random fields, i.e., colored noises and discrete white noises, both of which can lead to high-dimensional parametric dependence. The motivation is to use domain decomposition to exploit low-dimensional structures of local problems in the sub-domains, such that the total number of expensive PDE solves can be greatly reduced. Our objective is to develop an efficient model reduction method to simultaneously handle high-dimensionality and irregular behaviors of the stochastic PDEs under consideration. The advantages of our method lie in three aspects: (i) online-offline decomposition, i.e., the online cost is independent of the size of the triangle mesh; (ii) operator approximation for handling non-affine and high-dimensional random fields; (iii) effective strategy to capture irregular behaviors, e.g., sharp transitions of the PDE solution. Two numerical examples will be provided to demonstrate the advantageous performance of our method.
DSApr 27Code
Coloring for dispersion: A polynomial-time algorithm for cardinality-constrained 2-anticlusteringNguyen Khoa Tran, Lin Mu, Martin Papenberg et al.
The $k$-Maximum Dispersion Problem with Cardinality Constraints ($k$-MDCC) asks for a partition of a given item set with pairwise dissimilarities into $k$ cardinality-constrained groups such that the minimum pairwise intra-group dissimilarity, which is also known as the dispersion, is maximized. The problem arises in the context of anticlustering, where the goal is to create maximally heterogeneous groups of items with applications in psychological research, bioinformatics, and data science. It is known that $k$-MDCC is NP-hard for $k \geq 3$ but it has been an open question whether it can be solved in polynomial time for $k = 2$. We give a positive answer to this question by showing that $2$-MDCC can be solved by a quadratic number of cardinality-constrained 2-coloring problem instances ($2$-COLCC). We solve these instances by transforming them into a restricted class of subset sum instances. Although subset sum is NP-complete in general, for this restricted class the input values are bounded, ensuring that the pseudopolynomial dynamic programming algorithm runs in polynomial time. As a consequence, we obtain a polynomial-time algorithm for $2$-MDCC. We demonstrate that a publicly available open-source implementation of our new algorithm outperforms the previous integer linear programming solution by several orders of magnitude so that even large datasets ($n = 10{,}000$) can be processed in less than a second.
IRApr 7
From Clues to Generation: Language-Guided Conditional Diffusion for Cross-Domain RecommendationZiang Lu, Lei Sang, Lin Mu et al.
Cross-domain Recommendation (CDR) exploits multi-domain correlations to alleviate data sparsity. As a core task within this field, inter-domain recommendation focuses on predicting preferences for users who interact in a source domain but lack behavioral records in a target domain. Existing approaches predominantly rely on overlapping users as anchors for knowledge transfer. In real-world scenarios, overlapping users are often scarce, leaving the vast majority of users with only single-domain interactions. For these users, the absence of explicit alignment signals makes fine-grained preference transfer intrinsically difficult. To address this challenge, this paper proposes Language-Guided Conditional Diffusion for CDR (LGCD), a novel framework that integrates Large Language Models (LLMs) and diffusion models for inter-domain sequential recommendation. Specifically, we leverage LLM reasoning to bridge the domain gap by inferring potential target preferences for single-domain users and mapping them to real items, thereby constructing pseudo-overlapping data. We distinguish between real and pseudo-interaction pathways and introduce additional supervision constraints to mitigate the semantic noise brought by pseudo-interaction. Furthermore, we design a conditional diffusion architecture to precisely guide the generation of target user representations based on source-domain patterns. Extensive experiments demonstrate that LGCD significantly outperforms state-of-the-art methods in inter-domain recommendation tasks.
LGMar 10
From Representation to Clusters: A Contrastive Learning Approach for Attributed Hypergraph ClusteringLi Ni, Shuaikang Zeng, Lin Mu et al.
Contrastive learning has demonstrated strong performance in attributed hypergraph clustering. Typically, existing methods based on contrastive learning first learn node embeddings and then apply clustering algorithms, such as k-means, to these embeddings to obtain the clustering results.However, these methods lack direct clustering supervision, risking the inclusion of clustering-irrelevant information in the learned graph.To this end, we propose a Contrastive learning approach for Attributed Hypergraph Clustering (CAHC), an end-to-end method that simultaneously learns node embeddings and obtains clustering results. CAHC consists of two main steps: representation learning and cluster assignment learning. The former employs a novel contrastive learning approach that incorporates both node-level and hyperedge-level objectives to generate node embeddings.The latter joint embedding and clustering optimization to refine these embeddings by clustering-oriented guidance and obtains clustering results simultaneously.Extensive experimental results demonstrate that CAHC outperforms baselines on eight datasets.
CLMay 27, 2025Code
DenseLoRA: Dense Low-Rank Adaptation of Large Language ModelsLin Mu, Xiaoyu Wang, Li Ni et al.
Low-rank adaptation (LoRA) has been developed as an efficient approach for adapting large language models (LLMs) by fine-tuning two low-rank matrices, thereby reducing the number of trainable parameters. However, prior research indicates that many of the weights in these matrices are redundant, leading to inefficiencies in parameter utilization. To address this limitation, we introduce Dense Low-Rank Adaptation (DenseLoRA), a novel approach that enhances parameter efficiency while achieving superior performance compared to LoRA. DenseLoRA builds upon the concept of representation fine-tuning, incorporating a single Encoder-Decoder to refine and compress hidden representations across all adaptation layers before applying adaptation. Instead of relying on two redundant low-rank matrices as in LoRA, DenseLoRA adapts LLMs through a dense low-rank matrix, improving parameter utilization and adaptation efficiency. We evaluate DenseLoRA on various benchmarks, showing that it achieves 83.8% accuracy with only 0.01% of trainable parameters, compared to LoRA's 80.8% accuracy with 0.70% of trainable parameters on LLaMA3-8B. Additionally, we conduct extensive experiments to systematically assess the impact of DenseLoRA's components on overall model performance. Code is available at https://github.com/mulin-ahu/DenseLoRA.
LGMay 5, 2024
Tree-based Ensemble Learning for Out-of-distribution DetectionZhaiming Shen, Menglun Wang, Guang Cheng et al.
Being able to successfully determine whether the testing samples has similar distribution as the training samples is a fundamental question to address before we can safely deploy most of the machine learning models into practice. In this paper, we propose TOOD detection, a simple yet effective tree-based out-of-distribution (TOOD) detection mechanism to determine if a set of unseen samples will have similar distribution as of the training samples. The TOOD detection mechanism is based on computing pairwise hamming distance of testing samples' tree embeddings, which are obtained by fitting a tree-based ensemble model through in-distribution training samples. Our approach is interpretable and robust for its tree-based nature. Furthermore, our approach is efficient, flexible to various machine learning tasks, and can be easily generalized to unsupervised setting. Extensive experiments are conducted to show the proposed method outperforms other state-of-the-art out-of-distribution detection methods in distinguishing the in-distribution from out-of-distribution on various tabular, image, and text data.
CLAug 12, 2025
Weakly Supervised Fine-grained Span-Level Framework for Chinese Radiology Report Quality AssuranceKaiyu Wang, Lin Mu, Zhiyao Yang et al.
Quality Assurance (QA) for radiology reports refers to judging whether the junior reports (written by junior doctors) are qualified. The QA scores of one junior report are given by the senior doctor(s) after reviewing the image and junior report. This process requires intensive labor costs for senior doctors. Additionally, the QA scores may be inaccurate for reasons like diagnosis bias, the ability of senior doctors, and so on. To address this issue, we propose a Span-level Quality Assurance EvaluaTOR (Sqator) to mark QA scores automatically. Unlike the common document-level semantic comparison method, we try to analyze the semantic difference by exploring more fine-grained text spans. Specifically, Sqator measures QA scores by measuring the importance of revised spans between junior and senior reports, and outputs the final QA scores by merging all revised span scores. We evaluate Sqator using a collection of 12,013 radiology reports. Experimental results show that Sqator can achieve competitive QA scores. Moreover, the importance scores of revised spans can be also consistent with the judgments of senior doctors.
CLJun 4, 2025
Robustness of Prompting: Enhancing Robustness of Large Language Models Against Prompting AttacksLin Mu, Guowei Chu, Li Ni et al.
Large Language Models (LLMs) have demonstrated remarkable performance across various tasks by effectively utilizing a prompting strategy. However, they are highly sensitive to input perturbations, such as typographical errors or slight character order errors, which can substantially degrade their performance. Despite advances in prompting techniques, developing a prompting strategy that explicitly mitigates the negative impact of such perturbations remains an open challenge. To bridge this gap, we propose Robustness of Prompting (RoP), a novel prompting strategy specifically designed to enhance the robustness of LLMs. RoP consists of two stages: Error Correction and Guidance. In the Error Correction stage, RoP applies diverse perturbation methods to generate adversarial examples, which are then used to construct prompts that automatically correct input errors. In the Guidance stage, RoP generates an optimal guidance prompting based on the corrected input, steering the model toward more robust and accurate inferences. Through comprehensive experiments spanning arithmetic, commonsense, and logical reasoning tasks, we demonstrate that RoP significantly improves LLMs' robustness against adversarial perturbations. Notably, it maintains model accuracy with only minimal degradation compared to clean input scenarios, thereby establishing RoP as a practical and effective approach for enhancing LLM robustness in real-world applications.
SIMay 18, 2025
Community Search in Time-dependent Road-social Attributed NetworksLi Ni, Hengkai Xu, Lin Mu et al.
Real-world networks often involve both keywords and locations, along with travel time variations between locations due to traffic conditions. However, most existing cohesive subgraph-based community search studies utilize a single attribute, either keywords or locations, to identify communities. They do not simultaneously consider both keywords and locations, which results in low semantic or spatial cohesiveness of the detected communities, and they fail to account for variations in travel time. Additionally, these studies traverse the entire network to build efficient indexes, but the detected community only involves nodes around the query node, leading to the traversal of nodes that are not relevant to the community. Therefore, we propose the problem of discovering semantic-spatial aware k-core, which refers to a k-core with high semantic and time-dependent spatial cohesiveness containing the query node. To address this problem, we propose an exact and a greedy algorithm, both of which gradually expand outward from the query node. They are local methods that only access the local part of the attributed network near the query node rather than the entire network. Moreover, we design a method to calculate the semantic similarity between two keywords using large language models. This method alleviates the disadvantages of keyword-matching methods used in existing community search studies, such as mismatches caused by differently expressed synonyms and the presence of irrelevant words. Experimental results show that the greedy algorithm outperforms baselines in terms of structural, semantic, and time-dependent spatial cohesiveness.
SIMay 18, 2025
Pre-trained Prompt-driven Semi-supervised Local Community DetectionLi Ni, Hengkai Xu, Lin Mu et al.
Semi-supervised local community detection aims to leverage known communities to detect the community containing a given node. Although existing semi-supervised local community detection studies yield promising results, they suffer from time-consuming issues, highlighting the need for more efficient algorithms. Therefore, we apply the "pre-train, prompt" paradigm to semi-supervised local community detection and propose the Pre-trained Prompt-driven Semi-supervised Local community detection method (PPSL). PPSL consists of three main components: node encoding, sample generation, and prompt-driven fine-tuning. Specifically, the node encoding component employs graph neural networks to learn the representations of nodes and communities. Based on representations of nodes and communities, the sample generation component selects known communities that are structurally similar to the local structure of the given node as training samples. Finally, the prompt-driven fine-tuning component leverages these training samples as prompts to guide the final community prediction. Experimental results on five real-world datasets demonstrate that PPSL outperforms baselines in both community quality and efficiency.
SIMay 8, 2025
Community and hyperedge inference in multiple hypergraphsLi Ni, Ziqi Deng, Lin Mu et al.
Hypergraphs, capable of representing high-order interactions via hyperedges, have become a powerful tool for modeling real-world biological and social systems. Inherent relationships within these real-world systems, such as the encoding relationship between genes and their protein products, drive the establishment of interconnections between multiple hypergraphs. Here, we demonstrate how to utilize those interconnections between multiple hypergraphs to synthesize integrated information from multiple higher-order systems, thereby enhancing understanding of underlying structures. We propose a model based on the stochastic block model, which integrates information from multiple hypergraphs to reveal latent high-order structures. Real-world hyperedges exhibit preferential attachment, where certain nodes dominate hyperedge formation. To characterize this phenomenon, our model introduces hyperedge internal degree to quantify nodes' contributions to hyperedge formation. This model is capable of mining communities, predicting missing hyperedges of arbitrary sizes within hypergraphs, and inferring inter-hypergraph edges between hypergraphs. We apply our model to high-order datasets to evaluate its performance. Experimental results demonstrate strong performance of our model in community detection, hyperedge prediction, and inter-hypergraph edge prediction tasks. Moreover, we show that our model enables analysis of multiple hypergraphs of different types and supports the analysis of a single hypergraph in the absence of inter-hypergraph edges. Our work provides a practical and flexible tool for analyzing multiple hypergraphs, greatly advancing the understanding of the organization in real-world high-order systems.