LGAISIMay 2, 2024

IntraMix: Intra-Class Mixup Generation for Accurate Labels and Neighbors

arXiv:2405.00957v28 citationsh-index: 5Has CodeNIPS
Originality Incremental advance
AI Analysis

This addresses a key problem in graph learning for researchers and practitioners, offering a plug-and-play solution that is incremental but generalizable.

The paper tackles the dual challenges of insufficient accurate labels and limited neighbors in Graph Neural Networks (GNNs) by proposing IntraMix, which generates high-quality labeled data and enriches neighborhoods, achieving improved performance across various GNNs and datasets.

Graph Neural Networks (GNNs) have shown great performance in various tasks, with the core idea of learning from data labels and aggregating messages within the neighborhood of nodes. However, the common challenges in graphs are twofold: insufficient accurate (high-quality) labels and limited neighbors for nodes, resulting in weak GNNs. Existing graph augmentation methods typically address only one of these challenges, often adding training costs or relying on oversimplified or knowledge-intensive strategies, limiting their generalization. To simultaneously address both challenges faced by graphs in a generalized way, we propose an elegant method called IntraMix. Considering the incompatibility of vanilla Mixup with the complex topology of graphs, IntraMix innovatively employs Mixup among inaccurate labeled data of the same class, generating high-quality labeled data at minimal cost. Additionally, it finds data with high confidence of being clustered into the same group as the generated data to serve as their neighbors, thereby enriching the neighborhoods of graphs. IntraMix efficiently tackles both issues faced by graphs and challenges the prior notion of the limited effectiveness of Mixup in node classification. IntraMix is a theoretically grounded plug-in-play method that can be readily applied to all GNNs. Extensive experiments demonstrate the effectiveness of IntraMix across various GNNs and datasets. Our code is available at: https://github.com/Zhengsh123/IntraMix.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes