DSCCLGAug 19, 2023

Computing the Vapnik Chervonenkis Dimension for Non-Discrete Settings

arXiv:2308.10041v1h-index: 1
Originality Highly original
AI Analysis

This work addresses a foundational gap in computational learning theory by enabling VC dimension estimation for general settings, which is crucial for characterizing learnability in machine learning.

The paper tackles the problem of computing the Vapnik-Chervonenkis (VC) dimension in non-discrete settings, where previous algorithms were limited to finite domains, and presents a method using Empirical Risk Minimization (ERM) to approximate the VC dimension without constraints on concept classes or domain sets.

In 1984, Valiant [ 7 ] introduced the Probably Approximately Correct (PAC) learning framework for boolean function classes. Blumer et al. [ 2] extended this model in 1989 by introducing the VC dimension as a tool to characterize the learnability of PAC. The VC dimension was based on the work of Vapnik and Chervonenkis in 1971 [8 ], who introduced a tool called the growth function to characterize the shattering property. Researchers have since determined the VC dimension for specific classes, and efforts have been made to develop an algorithm that can calculate the VC dimension for any concept class. In 1991, Linial, Mansour, and Rivest [4] presented an algorithm for computing the VC dimension in the discrete setting, assuming that both the concept class and domain set were finite. However, no attempts had been made to design an algorithm that could compute the VC dimension in the general setting.Therefore, our work focuses on developing a method to approximately compute the VC dimension without constraints on the concept classes or their domain set. Our approach is based on our finding that the Empirical Risk Minimization (ERM) learning paradigm can be used as a new tool to characterize the shattering property of a concept class.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes