DSDMLGSINov 20, 2021

Correlation Clustering via Strong Triadic Closure Labeling: Fast Approximation Algorithms and Practical Lower Bounds

arXiv:2111.10699v228 citations
Originality Incremental advance
AI Analysis

This provides more scalable solutions for clustering tasks in data analysis, though it is incremental as it builds on existing special cases.

The paper tackles the impracticality of linear programming relaxations in correlation clustering by developing faster approximation algorithms for cluster editing and deletion, achieving near-optimal quality while scaling to much larger problems.

Correlation clustering is a widely studied framework for clustering based on pairwise similarity and dissimilarity scores, but its best approximation algorithms rely on impractical linear programming relaxations. We present faster approximation algorithms that avoid these relaxations, for two well-studied special cases: cluster editing and cluster deletion. We accomplish this by drawing new connections to edge labeling problems related to the principle of strong triadic closure. This leads to faster and more practical linear programming algorithms, as well as extremely scalable combinatorial techniques, including the first combinatorial approximation algorithm for cluster deletion. In practice, our algorithms produce approximate solutions that nearly match the best algorithms in quality, while scaling to problems that are orders of magnitude larger.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes