LGAIMay 25

MDGMIX: Boundary-Aware Subgraph Mixing for Multi-Domain Graph Pre-Training

arXiv:2605.2577153.8Has Code
AI Analysis

This work addresses the problem of high computational cost and data redundancy in multi-domain graph pre-training for few-shot classification tasks.

MDGMIX introduces a boundary-aware subgraph mixing method for multi-domain graph pre-training that reduces data redundancy and computational costs, achieving superior few-shot classification performance with better time and memory efficiency compared to strong baselines.

Multi-domain graph pre-training is a crucial step in constructing foundational graph models with cross-domain generalization capabilities. However, existing methods predominantly rely on jointly training all source domain graphs, resulting in high computational costs. Furthermore, it remains unclear whether all source domain graph data contribute equally to effective transfer. This paper empirically reveals significant data redundancy in multi-domain graph pre-training. Based on this finding, we propose the Multi-domain Graph Pre-training Framework, MDGMIX, which combines boundary-aware subgraph mixing with hierarchical discrimination. By selecting boundary nodes to construct challenging mixed-domain subgraphs, MDGMIX employs coarse-grained domain discrimination and fine-grained domain decomposition losses to decouple shared patterns from domain-specific patterns. During adaptation, MDGMIX employs a lightweight prompt weighting mechanism to transfer source domain knowledge. Extensive experiments demonstrate that MDGMIX consistently outperforms strong baselines in few-shot classification tasks while exhibiting superior time and memory efficiency. The code is available at: https://github.com/zhengziyu77/MDGMIX.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes