LGJul 11, 2015

A new boosting algorithm based on dual averaging scheme

arXiv:1507.03125v11 citations

Originality Incremental advance

AI Analysis

This work addresses generalization issues in boosting for machine learning practitioners, but it is incremental as it builds on existing optimization techniques.

The authors tackled the problem of poor generalization in boosting algorithms by developing DABoost, a new method based on a dual-averaging scheme, and showed that it achieves better generalization error than AdaBoost, although it is slower in reducing training error.

The fields of machine learning and mathematical optimization increasingly intertwined. The special topic on supervised learning and convex optimization examines this interplay. The training part of most supervised learning algorithms can usually be reduced to an optimization problem that minimizes a loss between model predictions and training data. While most optimization techniques focus on accuracy and speed of convergence, the qualities of good optimization algorithm from the machine learning perspective can be quite different since machine learning is more than fitting the data. Better optimization algorithms that minimize the training loss can possibly give very poor generalization performance. In this paper, we examine a particular kind of machine learning algorithm, boosting, whose training process can be viewed as functional coordinate descent on the exponential loss. We study the relation between optimization techniques and machine learning by implementing a new boosting algorithm. DABoost, based on dual-averaging scheme and study its generalization performance. We show that DABoost, although slower in reducing the training error, in general enjoys a better generalization error than AdaBoost.

View on arXiv PDF

Similar