MLLGJun 15, 2014

An Incremental Reseeding Strategy for Clustering

arXiv:1406.3837v11 citations
Originality Incremental advance
AI Analysis

This work addresses graph partitioning for clustering tasks, offering a faster and accurate solution, but it appears incremental as it builds on existing coarsen-cluster-refine approaches.

The paper tackles the problem of multiway graph partitioning by proposing an algorithm that alternates between diffusing seed vertices, thresholding, and random reseeding, achieving state-of-the-art cluster purity on standard benchmarks and running an order of magnitude faster than comparable methods.

In this work we propose a simple and easily parallelizable algorithm for multiway graph partitioning. The algorithm alternates between three basic components: diffusing seed vertices over the graph, thresholding the diffused seeds, and then randomly reseeding the thresholded clusters. We demonstrate experimentally that the proper combination of these ingredients leads to an algorithm that achieves state-of-the-art performance in terms of cluster purity on standard benchmarks datasets. Moreover, the algorithm runs an order of magnitude faster than the other algorithms that achieve comparable results in terms of accuracy. We also describe a coarsen, cluster and refine approach similar to GRACLUS and METIS that removes an additional order of magnitude from the runtime of our algorithm while still maintaining competitive accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes