LGMar 7, 2025

Near-Polynomially Competitive Active Logistic Regression

arXiv:2503.05981v44.1h-index: 1Has CodeAISTATS

Originality Highly original

AI Analysis

This addresses the challenge of reducing label queries in active learning for logistic regression, offering a near-optimal solution that is incremental in improving competitiveness.

The paper tackles the problem of active logistic regression in the realizable setting by presenting the first algorithm that is polynomially competitive with the optimal algorithm on every input instance, achieving label complexity polylogarithmic in error and domain size, with experiments showing performance gains.

We address the problem of active logistic regression in the realizable setting. It is well known that active learning can require exponentially fewer label queries compared to passive learning, in some cases using $\log \frac{1}{\eps}$ rather than $\poly(1/\eps)$ labels to get error $\eps$ larger than the optimum. We present the first algorithm that is polynomially competitive with the optimal algorithm on every input instance, up to factors polylogarithmic in the error and domain size. In particular, if any algorithm achieves label complexity polylogarithmic in $\eps$, so does ours. Our algorithm is based on efficient sampling and can be extended to learn more general class of functions. We further support our theoretical results with experiments demonstrating performance gains for logistic regression compared to existing active learning algorithms.

View on arXiv PDF Code

Similar