Spectral Clustering for Directed Graphs via Likelihood Estimation on Stochastic Block Models
This work addresses the underexplored challenge of clustering directed graphs, which is important for applications in network analysis, but it is incremental as it builds on existing stochastic block model frameworks.
The paper tackles the problem of spectral clustering for directed graphs by leveraging maximum likelihood estimation under a directed stochastic block model, resulting in a novel algorithm with a theoretical error bound and significant performance gains in experiments.
Graph clustering is a fundamental task in unsupervised learning with broad real-world applications. While spectral clustering methods for undirected graphs are well-established and guided by a minimum cut optimization consensus, their extension to directed graphs remains relatively underexplored due to the additional complexity introduced by edge directions. In this paper, we leverage statistical inference on stochastic block models to guide the development of a spectral clustering algorithm for directed graphs. Specifically, we study the maximum likelihood estimation under a widely used directed stochastic block model, and derive a global objective function that aligns with the underlying community structure. We further establish a theoretical upper bound on the misclustering error of its spectral relaxation, and based on this relaxation, introduce a novel, self-adaptive spectral clustering method for directed graphs. Extensive experiments on synthetic and real-world datasets demonstrate significant performance gains over existing baselines.