ML LG STJun 13, 2020

Sample complexity and effective dimension for regression on manifolds

Andrew McRae, Justin Romberg, Mark Davenport

arXiv:2006.07642v310.910 citations

Originality Incremental advance

AI Analysis

This work addresses the challenge of understanding dimensionality reduction in manifold-based machine learning problems, providing theoretical insights for practitioners, though it appears incremental by building on existing kernel methods and geometry.

The paper tackles the problem of regression on manifolds by establishing a nonasymptotic Weyl law, showing that smooth function spaces are effectively finite-dimensional with complexity scaling by manifold dimension, and proving that kernel regression achieves minimax-optimal error bounds controlled by effective dimension.

We consider the theory of regression on a manifold using reproducing kernel Hilbert space methods. Manifold models arise in a wide variety of modern machine learning problems, and our goal is to help understand the effectiveness of various implicit and explicit dimensionality-reduction methods that exploit manifold structure. Our first key contribution is to establish a novel nonasymptotic version of the Weyl law from differential geometry. From this we are able to show that certain spaces of smooth functions on a manifold are effectively finite-dimensional, with a complexity that scales according to the manifold dimension rather than any ambient data dimension. Finally, we show that given (potentially noisy) function values taken uniformly at random over a manifold, a kernel regression estimator (derived from the spectral decomposition of the manifold) yields minimax-optimal error bounds that are controlled by the effective dimension.

View on arXiv PDF

Similar