SI STAT-MECH SOC-PH MLMay 31, 2012

Oriented and Degree-generated Block Models: Generating and Inferring Communities with Inhomogeneous Degree Distributions

Yaojia Zhu, Xiaoran Yan, Cristopher Moore

arXiv:1205.7009v131 citations

Originality Incremental advance

AI Analysis

This work addresses the limitation of stochastic block models in handling real-world network inhomogeneities, offering improved community detection for applications like text analysis.

The paper tackled the problem of inferring community structure in networks with heavy-tailed degree distributions, where existing models fail to leverage vertex degrees and edge orientations. It introduced new block model variants that achieved higher accuracy on synthetic and word adjacency networks compared to standard and degree-corrected models.

The stochastic block model is a powerful tool for inferring community structure from network topology. However, it predicts a Poisson degree distribution within each community, while most real-world networks have a heavy-tailed degree distribution. The degree-corrected block model can accommodate arbitrary degree distributions within communities. But since it takes the vertex degrees as parameters rather than generating them, it cannot use them to help it classify the vertices, and its natural generalization to directed graphs cannot even use the orientations of the edges. In this paper, we present variants of the block model with the best of both worlds: they can use vertex degrees and edge orientations in the classification process, while tolerating heavy-tailed degree distributions within communities. We show that for some networks, including synthetic networks and networks of word adjacencies in English text, these new block models achieve a higher accuracy than either standard or degree-corrected block models.

View on arXiv PDF

Similar