LGMLSep 17, 2019

Global Optimal Path-Based Clustering Algorithm

arXiv:1909.07774v14 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses the challenge of achieving global optimal clustering for diverse datasets, though it appears incremental as it builds on existing path-based and medoid concepts.

The paper tackles the NP-hard problem of finding global optimum solutions in clustering by proposing the GOPC algorithm, which can recognize clusters of various shapes, sizes, and densities on synthetic datasets and demonstrates effectiveness and efficiency on real-world datasets.

Combinatorial optimization problems for clustering are known to be NP-hard. Most optimization methods are not able to find the global optimum solution for all datasets. To solve this problem, we propose a global optimal path-based clustering (GOPC) algorithm in this paper. The GOPC algorithm is based on two facts: (1) medoids have the minimum degree in their clusters; (2) the minimax distance between two objects in one cluster is smaller than the minimax distance between objects in different clusters. Extensive experiments are conducted on synthetic and real-world datasets to evaluate the performance of the GOPC algorithm. The results on synthetic datasets show that the GOPC algorithm can recognize all kinds of clusters regardless of their shapes, sizes, or densities. Experimental results on real-world datasets demonstrate the effectiveness and efficiency of the GOPC algorithm. In addition, the GOPC algorithm needs only one parameter, i.e., the number of clusters, which can be estimated by the decision graph. The advantages mentioned above make GOPC a good candidate as a general clustering algorithm. Codes are available at https://github.com/Qidong-Liu/Clustering.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes