LGDec 24, 2022

A Labelled Sample Compression Scheme of Size at Most Quadratic in the VC Dimension

arXiv:2212.12631v21.8h-index: 18

Originality Highly original

AI Analysis

This addresses a long-standing open problem in computational learning theory by providing a quadratic improvement, though it remains incremental as the optimal linear bound is still unresolved.

The paper tackles the problem of constructing a labelled sample compression scheme for finite concept classes, achieving a size bound of O(VCD^2), which substantially improves the previous exponential bound in Vapnik-Chervonenkis Dimension.

This paper presents a construction of a proper and stable labelled sample compression scheme of size $O(\VCD^2)$ for any finite concept class, where $\VCD$ denotes the Vapnik-Chervonenkis Dimension. The construction is based on a well-known model of machine teaching, referred to as recursive teaching dimension. This substantially improves on the currently best known bound on the size of sample compression schemes (due to Moran and Yehudayoff), which is exponential in $\VCD$. The long-standing open question whether the smallest size of a sample compression scheme is in $O(\VCD)$ remains unresolved, but our results show that research on machine teaching is a promising avenue for the study of this open problem. As further evidence of the strong connections between machine teaching and sample compression, we prove that the model of no-clash teaching, introduced by Kirkpatrick et al., can be used to define a non-trivial lower bound on the size of stable sample compression schemes.

View on arXiv PDF

Similar