LGAIDSMLSep 15, 2020

Optimal Decision Trees for Nonlinear Metrics

arXiv:2009.06921v229 citations
Originality Highly original
AI Analysis

This addresses the need for better decision trees in imbalanced classification tasks, offering a novel solution for nonlinear metrics, though it is incremental in extending optimal tree methods to new criteria.

The paper tackles the problem of optimizing decision trees for nonlinear metrics like F1-score, which are challenging for existing methods focused on linear criteria, and proposes a bi-objective optimization algorithm that computes provably optimal trees, with experiments showing reasonable runtimes for most datasets.

Nonlinear metrics, such as the F1-score, Matthews correlation coefficient, and Fowlkes-Mallows index, are often used to evaluate the performance of machine learning models, in particular, when facing imbalanced datasets that contain more samples of one class than the other. Recent optimal decision tree algorithms have shown remarkable progress in producing trees that are optimal with respect to linear criteria, such as accuracy, but unfortunately nonlinear metrics remain a challenge. To address this gap, we propose a novel algorithm based on bi-objective optimisation, which treats misclassifications of each binary class as a separate objective. We show that, for a large class of metrics, the optimal tree lies on the Pareto frontier. Consequently, we obtain the optimal tree by using our method to generate the set of all nondominated trees. To the best of our knowledge, this is the first method to compute provably optimal decision trees for nonlinear metrics. Our approach leads to a trade-off when compared to optimising linear metrics: the resulting trees may be more desirable according to the given nonlinear metric at the expense of higher runtimes. Nevertheless, the experiments illustrate that runtimes are reasonable for majority of the tested datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes