LGMSMLOct 24, 2018

CatBoost: gradient boosting with categorical features support

arXiv:1810.11363v11865 citationsHas Code
Originality Incremental advance
AI Analysis

This provides a more efficient and effective gradient boosting solution for machine learning practitioners dealing with categorical data, though it is incremental in nature.

The authors tackled the problem of gradient boosting with categorical features by introducing CatBoost, a new library that outperforms existing implementations in quality on popular datasets and offers faster GPU learning and CPU scoring.

In this paper we present CatBoost, a new open-sourced gradient boosting library that successfully handles categorical features and outperforms existing publicly available implementations of gradient boosting in terms of quality on a set of popular publicly available datasets. The library has a GPU implementation of learning algorithm and a CPU implementation of scoring algorithm, which are significantly faster than other gradient boosting libraries on ensembles of similar sizes.

Code Implementations4 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes