MLMay 20, 2017

Accelerated Hierarchical Density Clustering

arXiv:1705.07321v2503 citationsHas Code
AI Analysis

This provides an incremental improvement for users of density-based clustering algorithms by making accelerated HDBSCAN* a more efficient and practical default choice.

The paper tackles the problem of improving hierarchical density-based clustering by accelerating HDBSCAN*, achieving comparable performance to DBSCAN while supporting variable density clusters and eliminating the need for a difficult-to-tune distance scale parameter.

We present an accelerated algorithm for hierarchical density based clustering. Our new algorithm improves upon HDBSCAN*, which itself provided a significant qualitative improvement over the popular DBSCAN algorithm. The accelerated HDBSCAN* algorithm provides comparable performance to DBSCAN, while supporting variable density clusters, and eliminating the need for the difficult to tune distance scale parameter. This makes accelerated HDBSCAN* the default choice for density based clustering. Library available at: https://github.com/scikit-learn-contrib/hdbscan

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes