CC DS LG MLSep 17, 2024

The Sample Complexity of Smooth Boosting and the Tightness of the Hardcore Theorem

Guy Blanc, Alexandre Hayderi, Caleb Koch, Li-Yang Tan

arXiv:2409.11597v12.33 citationsh-index: 9

Originality Incremental advance

AI Analysis

This work addresses fundamental questions in computational learning theory and complexity theory, providing tight bounds for smooth boosting and the hardcore theorem, which are incremental but important for understanding limitations in these areas.

The paper settles the sample complexity of smooth boosting, showing that strong learning over uniform distributions requires a quadratic overhead in samples compared to weak learning over smooth distributions, matching existing boosters and separating from distribution-independent boosting. It also proves that the circuit size loss in Impagliazzo's hardcore theorem is necessary and optimal.

Smooth boosters generate distributions that do not place too much weight on any given example. Originally introduced for their noise-tolerant properties, such boosters have also found applications in differential privacy, reproducibility, and quantum learning theory. We study and settle the sample complexity of smooth boosting: we exhibit a class that can be weak learned to $γ$-advantage over smooth distributions with $m$ samples, for which strong learning over the uniform distribution requires $\tildeΩ(1/γ^2)\cdot m$ samples. This matches the overhead of existing smooth boosters and provides the first separation from the setting of distribution-independent boosting, for which the corresponding overhead is $O(1/γ)$. Our work also sheds new light on Impagliazzo's hardcore theorem from complexity theory, all known proofs of which can be cast in the framework of smooth boosting. For a function $f$ that is mildly hard against size-$s$ circuits, the hardcore theorem provides a set of inputs on which $f$ is extremely hard against size-$s'$ circuits. A downside of this important result is the loss in circuit size, i.e. that $s' \ll s$. Answering a question of Trevisan, we show that this size loss is necessary and in fact, the parameters achieved by known proofs are the best possible.

View on arXiv PDF

Similar