LG MLFeb 20, 2024

Statistical curriculum learning: An elimination algorithm achieving an oracle risk

arXiv:2402.13366v12.6h-index: 3COLT

Originality Incremental advance

AI Analysis

This work addresses the challenge of efficient parameter estimation in statistical learning with adaptive sampling, offering incremental improvements in curriculum learning methods.

The paper tackles the problem of curriculum learning in a parametric prediction setting by proposing an elimination algorithm that adaptively samples from target and source models to estimate a parameter vector, achieving a risk that matches an oracle benchmark in single-source cases and characterizing conditions for matching a weak-oracle benchmark in multiple-source cases.

We consider a statistical version of curriculum learning (CL) in a parametric prediction setting. The learner is required to estimate a target parameter vector, and can adaptively collect samples from either the target model, or other source models that are similar to the target model, but less noisy. We consider three types of learners, depending on the level of side-information they receive. The first two, referred to as strong/weak-oracle learners, receive high/low degrees of information about the models, and use these to learn. The third, a fully adaptive learner, estimates the target parameter vector without any prior information. In the single source case, we propose an elimination learning method, whose risk matches that of a strong-oracle learner. In the multiple source case, we advocate that the risk of the weak-oracle learner is a realistic benchmark for the risk of adaptive learners. We develop an adaptive multiple elimination-rounds CL algorithm, and characterize instance-dependent conditions for its risk to match that of the weak-oracle learner. We consider instance-dependent minimax lower bounds, and discuss the challenges associated with defining the class of instances for the bound. We derive two minimax lower bounds, and determine the conditions under which the performance weak-oracle learner is minimax optimal.

View on arXiv PDF

Similar