DenMune: Density peak based clustering using mutual nearest neighbors
This addresses clustering challenges in data science for applications with complex data distributions, though it appears incremental as it builds on density-based methods.
The authors tackled the problem of clustering data with arbitrary shapes, varying densities, and unbalanced classes by introducing DenMune, a novel algorithm based on mutual nearest neighborhoods, which automatically detects noise and produces robust results across low and high-dimensional datasets.
Many clustering algorithms fail when clusters are of arbitrary shapes, of varying densities, or the data classes are unbalanced and close to each other, even in two dimensions. A novel clustering algorithm, DenMune is presented to meet this challenge. It is based on identifying dense regions using mutual nearest neighborhoods of size K, where K is the only parameter required from the user, besides obeying the mutual nearest neighbor consistency principle. The algorithm is stable for a wide range of values of K. Moreover, it is able to automatically detect and remove noise from the clustering process as well as detecting the target clusters. It produces robust results on various low and high-dimensional datasets relative to several known state-of-the-art clustering algorithms.