STITMLMar 28, 2016

Analysis of k-Nearest Neighbor Distances with Application to Entropy Estimation

arXiv:1603.08578v238 citations
Originality Incremental advance
AI Analysis

It addresses theoretical gaps for widely used entropy and mutual information estimators in machine learning, but is incremental as it builds on existing methods.

This paper tackles the problem of finite-sample behavior analysis for the Kozachenko-Leonenko (KL) entropy estimator, proving bounds on its bias and variance and showing it achieves the minimax convergence rate for certain smooth function classes.

Estimating entropy and mutual information consistently is important for many machine learning applications. The Kozachenko-Leonenko (KL) estimator (Kozachenko & Leonenko, 1987) is a widely used nonparametric estimator for the entropy of multivariate continuous random variables, as well as the basis of the mutual information estimator of Kraskov et al. (2004), perhaps the most widely used estimator of mutual information in this setting. Despite the practical importance of these estimators, major theoretical questions regarding their finite-sample behavior remain open. This paper proves finite-sample bounds on the bias and variance of the KL estimator, showing that it achieves the minimax convergence rate for certain classes of smooth functions. In proving these bounds, we analyze finite-sample behavior of k-nearest neighbors (k-NN) distance statistics (on which the KL estimator is based). We derive concentration inequalities for k-NN distances and a general expectation bound for statistics of k-NN distances, which may be useful for other analyses of k-NN methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes