LG MLOct 23, 2020

On the Equivalence of Decoupled Graph Convolution Network and Label Propagation

Hande Dong, Jiawei Chen, Fuli Feng, Xiangnan He, Shuxian Bi, Zhaolin Ding, Peng Cui

arXiv:2010.12408v221.9124 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses a fundamental gap in understanding for researchers in graph neural networks, offering insights into decoupled GCN mechanisms and improving semi-supervised node classification, though it is incremental as it builds on existing decoupled GCN paradigms.

The paper tackles the lack of understanding in decoupled Graph Convolution Networks (GCNs) by proving their equivalence to a two-step label propagation process, revealing that they assign structure- and model-aware weights to pseudo-labels, which explains their robustness to structure noise and over-smoothing but sensitivity to label noise and initialization. It proposes a new method, Propagation then Training Adaptively (PTA), which is shown to be more effective and robust than decoupled GCNs on four benchmark datasets.

The original design of Graph Convolution Network (GCN) couples feature transformation and neighborhood aggregation for node representation learning. Recently, some work shows that coupling is inferior to decoupling, which supports deep graph propagation better and has become the latest paradigm of GCN (e.g., APPNP and SGCN). Despite effectiveness, the working mechanisms of the decoupled GCN are not well understood. In this paper, we explore the decoupled GCN for semi-supervised node classification from a novel and fundamental perspective -- label propagation. We conduct thorough theoretical analyses, proving that the decoupled GCN is essentially the same as the two-step label propagation: first, propagating the known labels along the graph to generate pseudo-labels for the unlabeled nodes, and second, training normal neural network classifiers on the augmented pseudo-labeled data. More interestingly, we reveal the effectiveness of decoupled GCN: going beyond the conventional label propagation, it could automatically assign structure- and model- aware weights to the pseudo-label data. This explains why the decoupled GCN is relatively robust to the structure noise and over-smoothing, but sensitive to the label noise and model initialization. Based on this insight, we propose a new label propagation method named Propagation then Training Adaptively (PTA), which overcomes the flaws of the decoupled GCN with a dynamic and adaptive weighting strategy. Our PTA is simple yet more effective and robust than decoupled GCN. We empirically validate our findings on four benchmark datasets, demonstrating the advantages of our method. The code is available at https://github.com/DongHande/PT_propagation_then_training.

View on arXiv PDF Code

Similar