LGMLFeb 7, 2019

Bounds for the VC Dimension of 1NN Prototype Sets

arXiv:1902.02660v1
Originality Synthesis-oriented
AI Analysis

This work addresses a theoretical gap in statistical learning for researchers, providing foundational insights into sample complexity for 1NN classifiers, though it is incremental as it builds on scattered prior results.

The paper tackles the lack of theoretical bounds for the VC dimension of 1NN classifiers with fixed-size prototype sets by collecting existing results to derive explicit lower and upper bounds, and introduces a new lower bound for the two-dimensional case using a geometrical argument.

In Statistical Learning, the Vapnik-Chervonenkis (VC) dimension is an important combinatorial property of classifiers. To our knowledge, no theoretical results yet exist for the VC dimension of edited nearest-neighbour (1NN) classifiers with reference set of fixed size. Related theoretical results are scattered in the literature and their implications have not been made explicit. We collect some relevant results and use them to provide explicit lower and upper bounds for the VC dimension of 1NN classifiers with a prototype set of fixed size. We discuss the implications of these bounds for the size of training set needed to learn such a classifier to a given accuracy. Further, we provide a new lower bound for the two-dimensional case, based on a new geometrical argument.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes