LG MLJun 30, 2020

SCE: Scalable Network Embedding from Sparsest Cut

Shengzhong Zhang, Zengfeng Huang, Haicang Zhou, Ziang Zhou

arXiv:2006.16499v410.125 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the problem of scalable network embedding for graph analysis, offering a simpler and faster training approach, though it appears incremental in the context of contrastive learning methods.

The paper tackles unsupervised network embedding by proposing SCE, a method that uses only negative samples for training, inspired by the sparsest cut problem and employing a Laplacian smoothing trick with GCN-type encoders. The results show advantages in accuracy and scalability over strong baselines like GraphSAGE, G2G, and DGI on real-world datasets.

Large-scale network embedding is to learn a latent representation for each node in an unsupervised manner, which captures inherent properties and structural information of the underlying graph. In this field, many popular approaches are influenced by the skip-gram model from natural language processing. Most of them use a contrastive objective to train an encoder which forces the embeddings of similar pairs to be close and embeddings of negative samples to be far. A key of success to such contrastive learning methods is how to draw positive and negative samples. While negative samples that are generated by straightforward random sampling are often satisfying, methods for drawing positive examples remains a hot topic. In this paper, we propose SCE for unsupervised network embedding only using negative samples for training. Our method is based on a new contrastive objective inspired by the well-known sparsest cut problem. To solve the underlying optimization problem, we introduce a Laplacian smoothing trick, which uses graph convolutional operators as low-pass filters for smoothing node representations. The resulting model consists of a GCN-type structure as the encoder and a simple loss function. Notably, our model does not use positive samples but only negative samples for training, which not only makes the implementation and tuning much easier, but also reduces the training time significantly. Finally, extensive experimental studies on real world data sets are conducted. The results clearly demonstrate the advantages of our new model in both accuracy and scalability compared to strong baselines such as GraphSAGE, G2G and DGI.

View on arXiv PDF Code

Similar