LGMLFeb 10, 2019

ELKI: A large open-source library for data analysis - ELKI Release 0.7.5 "Heidelberg"

arXiv:1902.03616v157 citationsHas Code
Originality Synthesis-oriented
AI Analysis

It provides a tool for researchers and students to easily implement and benchmark algorithms in data mining, but is incremental as a version update.

The paper documents the release of ELKI version 0.7.5, an open-source Java library focused on unsupervised data mining algorithms like cluster analysis and outlier detection, emphasizing high performance through data index structures and extensibility for research.

This paper documents the release of the ELKI data mining framework, version 0.7.5. ELKI is an open source (AGPLv3) data mining software written in Java. The focus of ELKI is research in algorithms, with an emphasis on unsupervised methods in cluster analysis and outlier detection. In order to achieve high performance and scalability, ELKI offers data index structures such as the R*-tree that can provide major performance gains. ELKI is designed to be easy to extend for researchers and students in this domain, and welcomes contributions of additional methods. ELKI aims at providing a large collection of highly parameterizable algorithms, in order to allow easy and fair evaluation and benchmarking of algorithms. We will first outline the motivation for this release, the plans for the future, and then give a brief overview over the new functionality in this version. We also include an appendix presenting an overview on the overall implemented functionality.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes