LG OC MLOct 31, 2024

Sample-Efficient Agnostic Boosting

arXiv:2410.23632v16.42 citationsh-index: 7NIPS

Originality Incremental advance

AI Analysis

This work addresses a key bottleneck in machine learning by improving the sample efficiency of agnostic boosting, which is important for applications where data is limited, though it is incremental in closing the gap to ERM.

The paper tackles the sample inefficiency of agnostic boosting algorithms compared to Empirical Risk Minimization (ERM) by developing a new algorithm that is substantially more sample efficient without increasing computational complexity, leveraging sample reuse across boosting rounds to achieve better generalization error.

The theory of boosting provides a computational framework for aggregating approximate weak learning algorithms, which perform marginally better than a random predictor, into an accurate strong learner. In the realizable case, the success of the boosting approach is underscored by a remarkable fact that the resultant sample complexity matches that of a computationally demanding alternative, namely Empirical Risk Minimization (ERM). This in particular implies that the realizable boosting methodology has the potential to offer computational relief without compromising on sample efficiency. Despite recent progress, in agnostic boosting, where assumptions on the conditional distribution of labels given feature descriptions are absent, ERM outstrips the agnostic boosting methodology in being quadratically more sample efficient than all known agnostic boosting algorithms. In this paper, we make progress on closing this gap, and give a substantially more sample efficient agnostic boosting algorithm than those known, without compromising on the computational (or oracle) complexity. A key feature of our algorithm is that it leverages the ability to reuse samples across multiple rounds of boosting, while guaranteeing a generalization error strictly better than those obtained by blackbox applications of uniform convergence arguments. We also apply our approach to other previously studied learning problems, including boosting for reinforcement learning, and demonstrate improved results.

View on arXiv PDF

Similar