SISTAT-MECHDSLGOCSep 10, 2022

Bayan Algorithm: Detecting Communities in Networks Through Exact and Approximate Optimization of Modularity

arXiv:2209.04562v514 citationsh-index: 17Has Code
Originality Incremental advance
AI Analysis

This provides a more reliable method for community detection in small networks, though it is incremental as it builds on modularity optimization with computational improvements.

The paper tackles the problem of community detection in networks by proposing the Bayan algorithm, which globally maximizes modularity or approximates it with guarantees, achieving higher accuracy and stability in retrieving planted partitions than 29 other methods across standard benchmarks.

Community detection is a classic network problem with extensive applications in various fields. Its most common method is using modularity maximization heuristics which rarely return an optimal partition or anything similar. Partitions with globally optimal modularity are difficult to compute, and therefore have been underexplored. Using structurally diverse networks, we compare 30 community detection methods including our proposed algorithm that offers optimality and approximation guarantees: the Bayan algorithm. Unlike existing methods, Bayan globally maximizes modularity or approximates it within a factor. Our results show the distinctive accuracy and stability of maximum-modularity partitions in retrieving planted partitions at rates higher than most alternatives for a wide range of parameter settings in two standard benchmarks. Compared to the partitions from 29 other algorithms, maximum-modularity partitions have the best medians for description length, coverage, performance, average conductance, and well clusteredness. These advantages come at the cost of additional computations which Bayan makes possible for small networks (networks that have up to 3000 edges in their largest connected component). Bayan is several times faster than using open-source and commercial solvers for modularity maximization, making it capable of finding optimal partitions for instances that cannot be optimized by any other existing method. Our results point to a few well performing algorithms, among which Bayan stands out as the most reliable method for small networks. A Python implementation of the Bayan algorithm (bayanpy) is publicly available through the package installer for Python.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes