LGMLOct 11, 2020

ADABOOK & MULTIBOOK: Adaptive Boosting with Chance Correction

arXiv:2010.15550v11 citations
Originality Incremental advance
AI Analysis

This addresses a specific issue in machine learning evaluation for researchers and practitioners, but it is incremental as it builds on existing boosting and chance-corrected measure work.

The paper tackles the problem of boosting algorithms surrendering early when optimizing accuracy, which harms performance on chance-corrected measures like Kappa, by proposing AdaBook and Multibook that optimize these measures, showing they can beat standard AdaBoost and MultiBoost in multiclass situations.

There has been considerable interest in boosting and bagging, including the combination of the adaptive techniques of AdaBoost with the random selection with replacement techniques of Bagging. At the same time there has been a revisiting of the way we evaluate, with chance-corrected measures like Kappa, Informedness, Correlation or ROC AUC being advocated. This leads to the question of whether learning algorithms can do better by optimizing an appropriate chance corrected measure. Indeed, it is possible for a weak learner to optimize Accuracy to the detriment of the more reaslistic chance-corrected measures, and when this happens the booster can give up too early. This phenomenon is known to occur with conventional Accuracy-based AdaBoost, and the MultiBoost algorithm has been developed to overcome such problems using restart techniques based on bagging. This paper thus complements the theoretical work showing the necessity of using chance-corrected measures for evaluation, with empirical work showing how use of a chance-corrected measure can improve boosting. We show that the early surrender problem occurs in MultiBoost too, in multiclass situations, so that chance-corrected AdaBook and Multibook can beat standard Multiboost or AdaBoost, and we further identify which chance-corrected measures to use when.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes