LGDSCOMLJul 24, 2017

Engineering fast multilevel support vector machines

arXiv:1707.07657v310 citationsHas Code
AI Analysis

This work addresses performance bottlenecks in SVM training for machine learning practitioners dealing with large datasets, though it is incremental as it builds on existing multilevel methods.

The paper tackles the high computational cost of nonlinear support vector machines (SVMs) on large-scale data, especially with imbalanced classes, by introducing a fast multilevel framework that achieves significant speed up compared to state-of-the-art libraries.

The computational complexity of solving nonlinear support vector machine (SVM) is prohibitive on large-scale data. In particular, this issue becomes very sensitive when the data represents additional difficulties such as highly imbalanced class sizes. Typically, nonlinear kernels produce significantly higher classification quality to linear kernels but introduce extra kernel and model parameters which requires computationally expensive fitting. This increases the quality but also reduces the performance dramatically. We introduce a generalized fast multilevel framework for regular and weighted SVM and discuss several versions of its algorithmic components that lead to a good trade-off between quality and time. Our framework is implemented using PETSc which allows an easy integration with scientific computing tasks. The experimental results demonstrate significant speed up compared to the state-of-the-art nonlinear SVM libraries. Reproducibility: our source code, documentation and parameters are available at https:// github.com/esadr/mlsvm.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes