SILGMLMay 25, 2018

Scalable and Robust Community Detection with Randomized Sketching

arXiv:1805.10927v411 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of scalable community detection in large graphs for applications like social network analysis, though it appears incremental as it builds on existing matrix-decomposition-based methods with new sampling techniques.

The paper tackles the problem of unsupervised clustering in large, partially observed graphs by proposing a scalable randomized framework that uses sketching and sampling techniques to reduce computational complexity and improve performance, achieving nearly dimension-free phase transitions in low inter-cluster connectivity regimes.

This article explores and analyzes the unsupervised clustering of large partially observed graphs. We propose a scalable and provable randomized framework for clustering graphs generated from the stochastic block model. The clustering is first applied to a sub-matrix of the graph's adjacency matrix associated with a reduced graph sketch constructed using random sampling. Then, the clusters of the full graph are inferred based on the clusters extracted from the sketch using a correlation-based retrieval step. Uniform random node sampling is shown to improve the computational complexity over clustering of the full graph when the cluster sizes are balanced. A new random degree-based node sampling algorithm is presented which significantly improves upon the performance of the clustering algorithm even when clusters are unbalanced. This framework improves the phase transitions for matrix-decomposition-based clustering with regard to computational complexity and minimum cluster size, which are shown to be nearly dimension-free in the low inter-cluster connectivity regime. A third sampling technique is shown to improve balance by randomly sampling nodes based on spatial distribution. We provide analysis and numerical results using a convex clustering algorithm based on matrix completion.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes