Designing and building the mlpack open-source machine learning library
This library provides a fast and flexible tool for machine learning practitioners, though it is incremental as it builds upon existing open-source library concepts.
The paper presents mlpack, an open-source C++ machine learning library focused on speed and flexibility, which implements a wide range of algorithms from standard to cutting-edge techniques and demonstrates superior performance in benchmarks compared to other libraries.
mlpack is an open-source C++ machine learning library with an emphasis on speed and flexibility. Since its original inception in 2007, it has grown to be a large project implementing a wide variety of machine learning algorithms, from standard techniques such as decision trees and logistic regression to modern techniques such as deep neural networks as well as other recently-published cutting-edge techniques not found in any other library. mlpack is quite fast, with benchmarks showing mlpack outperforming other libraries' implementations of the same methods. mlpack has an active community, with contributors from around the world---including some from PUST. This short paper describes the goals and design of mlpack, discusses how the open-source community functions, and shows an example usage of mlpack for a simple data science problem.