LG AIOct 15, 2022

Augmentation-Free Graph Contrastive Learning of Invariant-Discriminative Representations

Haifeng Li, Jun Cao, Jiawei Zhu, Qinyao Luo, Silu He, Xuyin Wang

arXiv:2210.08345v218.168 citationsh-index: 27Has Code

Originality Incremental advance

AI Analysis

This work addresses a key bottleneck in graph representation learning by eliminating the need for data augmentation, which is incremental but offers practical improvements for researchers and practitioners in graph-based tasks.

The paper tackles the problem of graph contrastive learning requiring data augmentation and negative samples by proposing an augmentation-free method, iGCL, which uses an invariant-discriminative loss to learn representations; it outperforms baselines on 5 node classification datasets, showing superior performance across label ratios and robustness to graph attacks.

The pretasks are mainly built on mutual information estimation, which requires data augmentation to construct positive samples with similar semantics to learn invariant signals and negative samples with dissimilar semantics in order to empower representation discriminability. However, an appropriate data augmentation configuration depends heavily on lots of empirical trials such as choosing the compositions of data augmentation techniques and the corresponding hyperparameter settings. We propose an augmentation-free graph contrastive learning method, invariant-discriminative graph contrastive learning (iGCL), that does not intrinsically require negative samples. iGCL designs the invariant-discriminative loss (ID loss) to learn invariant and discriminative representations. On the one hand, ID loss learns invariant signals by directly minimizing the mean square error between the target samples and positive samples in the representation space. On the other hand, ID loss ensures that the representations are discriminative by an orthonormal constraint forcing the different dimensions of representations to be independent of each other. This prevents representations from collapsing to a point or subspace. Our theoretical analysis explains the effectiveness of ID loss from the perspectives of the redundancy reduction criterion, canonical correlation analysis, and information bottleneck principle. The experimental results demonstrate that iGCL outperforms all baselines on 5 node classification benchmark datasets. iGCL also shows superior performance for different label ratios and is capable of resisting graph attacks, which indicates that iGCL has excellent generalization and robustness. The source code is available at https://github.com/lehaifeng/T-GCN/tree/master/iGCL.

View on arXiv PDF Code

Similar