MLAug 8, 2016

Boosting as a kernel-based method

arXiv:1608.02485v29 citations
AI Analysis

This work provides a theoretical and practical link between two major machine learning paradigms, potentially improving efficiency and applicability for researchers and practitioners in ML.

The paper tackles the problem of connecting boosting and kernel-based methods, showing that boosting with a weak linear learner is equivalent to estimation with a special boosting kernel, and generalizes this to a broad class of boosting approaches for more general weak learners, enabling fast hyperparameter tuning and applications like robust regression and classification.

Boosting combines weak (biased) learners to obtain effective learning algorithms for classification and prediction. In this paper, we show a connection between boosting and kernel-based methods, highlighting both theoretical and practical applications. In the context of $\ell_2$ boosting, we start with a weak linear learner defined by a kernel $K$. We show that boosting with this learner is equivalent to estimation with a special {\it boosting kernel} that depends on $K$, as well as on the regression matrix, noise variance, and hyperparameters. The number of boosting iterations is modeled as a continuous hyperparameter, and fit along with other parameters using standard techniques. We then generalize the boosting kernel to a broad new class of boosting approaches for more general weak learners, including those based on the $\ell_1$, hinge and Vapnik losses. The approach allows fast hyperparameter tuning for this general class, and has a wide range of applications, including robust regression and classification. We illustrate some of these applications with numerical examples on synthetic and real data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes