Deep Embedded Clustering with Distribution Consistency Preservation for Attributed Networks
This work addresses the problem of mining information in attributed networks for researchers and practitioners, but it is incremental as it builds on existing deep embedded clustering approaches by adding a consistency constraint.
The authors tackled clustering in attributed networks by proposing a deep embedded clustering model that enforces consistency between cluster distributions from network topology and node attributes, achieving significantly better or competitive performance compared to state-of-the-art methods on several datasets.
Many complex systems in the real world can be characterized by attributed networks. To mine the potential information in these networks, deep embedded clustering, which obtains node representations and clusters simultaneously, has been paid much attention in recent years. Under the assumption of consistency for data in different views, the cluster structure of network topology and that of node attributes should be consistent for an attributed network. However, many existing methods ignore this property, even though they separately encode node representations from network topology and node attributes meanwhile clustering nodes on representation vectors learnt from one of the views. Therefore, in this study, we propose an end-to-end deep embedded clustering model for attributed networks. It utilizes graph autoencoder and node attribute autoencoder to respectively learn node representations and cluster assignments. In addition, a distribution consistency constraint is introduced to maintain the latent consistency of cluster distributions of two views. Extensive experiments on several datasets demonstrate that the proposed model achieves significantly better or competitive performance compared with the state-of-the-art methods. The source code can be found at https://github.com/Zhengymm/DCP.