LG AIJan 7, 2024

Few-Shot Causal Representation Learning for Out-of-Distribution Generalization on Heterogeneous Graphs

Pengfei Ding, Yan Wang, Guanfeng Liu, Nan Wang, Xiaofang Zhou

arXiv:2401.03597v310.411 citationsh-index: 8IEEE Trans Knowl Data Eng

Originality Highly original

AI Analysis

This addresses a novel problem of distribution shifts in heterogeneous graph learning, which is incremental as it builds on existing few-shot methods but introduces causal modeling for robustness.

The paper tackles the problem of out-of-distribution generalization in heterogeneous graph few-shot learning, where distribution shifts between source and target data degrade performance, and proposes a causal model (COHF) that achieves superior results on seven real-world datasets.

Heterogeneous graph few-shot learning (HGFL) has been developed to address the label sparsity issue in heterogeneous graphs (HGs), which consist of various types of nodes and edges. The core concept of HGFL is to extract knowledge from rich-labeled classes in a source HG, transfer this knowledge to a target HG to facilitate learning new classes with few-labeled training data, and finally make predictions on unlabeled testing data. Existing methods typically assume that the source HG, training data, and testing data all share the same distribution. However, in practice, distribution shifts among these three types of data are inevitable due to two reasons: (1) the limited availability of the source HG that matches the target HG distribution, and (2) the unpredictable data generation mechanism of the target HG. Such distribution shifts result in ineffective knowledge transfer and poor learning performance in existing methods, thereby leading to a novel problem of out-of-distribution (OOD) generalization in HGFL. To address this challenging problem, we propose a novel Causal OOD Heterogeneous graph Few-shot learning model, namely COHF. In COHF, we first characterize distribution shifts in HGs with a structural causal model, establishing an invariance principle for OOD generalization in HGFL. Then, following this invariance principle, we propose a new variational autoencoder-based heterogeneous graph neural network to mitigate the impact of distribution shifts. Finally, by integrating this network with a novel meta-learning framework, COHF effectively transfers knowledge to the target HG to predict new classes with few-labeled data. Extensive experiments on seven real-world datasets have demonstrated the superior performance of COHF over the state-of-the-art methods.

View on arXiv PDF

Similar