LGFeb 14, 2024

Unifying Invariance and Spuriousity for Graph Out-of-Distribution via Probability of Necessity and Sufficiency

Xuexin Chen, Ruichu Cai, Kaitao Zheng, Zhifan Jiang, Zhengting Huang, Zhifeng Hao, Zijian Li

arXiv:2402.09165v16.43 citationsh-index: 20

Originality Incremental advance

AI Analysis

This addresses graph OOD generalization, a domain-specific problem with real-world applications, but appears incremental as it builds on existing environment augmentation methods.

The paper tackles the problem of graph out-of-distribution generalization, where models trained on biased data need to generalize to unseen test data, by proposing a unified framework called PNSIS that extracts invariant substructures using probability of necessity and sufficiency and leverages spurious subgraphs to enhance robustness. Experimental results show that PNSIS outperforms state-of-the-art techniques on several benchmarks.

Graph Out-of-Distribution (OOD), requiring that models trained on biased data generalize to the unseen test data, has a massive of real-world applications. One of the most mainstream methods is to extract the invariant subgraph by aligning the original and augmented data with the help of environment augmentation. However, these solutions might lead to the loss or redundancy of semantic subgraph and further result in suboptimal generalization. To address this challenge, we propose a unified framework to exploit the Probability of Necessity and Sufficiency to extract the Invariant Substructure (PNSIS). Beyond that, this framework further leverages the spurious subgraph to boost the generalization performance in an ensemble manner to enhance the robustness on the noise data. Specificially, we first consider the data generation process for graph data. Under mild conditions, we show that the invariant subgraph can be extracted by minimizing an upper bound, which is built on the theoretical advance of probability of necessity and sufficiency. To further bridge the theory and algorithm, we devise the PNSIS model, which involves an invariant subgraph extractor for invariant graph learning as well invariant and spurious subgraph classifiers for generalization enhancement. Experimental results demonstrate that our \textbf{PNSIS} model outperforms the state-of-the-art techniques on graph OOD on several benchmarks, highlighting the effectiveness in real-world scenarios.

View on arXiv PDF

Similar