MLLGDGDSCOJul 27, 2020

Normal-bundle Bootstrap

arXiv:2007.13869v16 citations
Originality Incremental advance
AI Analysis

This method addresses data augmentation and statistical inference for datasets with geometric structure, but appears incremental as it builds on existing manifold learning and differential geometry concepts.

The paper tackles the problem of generating new data that preserves the geometric structure of a dataset, as described by the manifold distribution hypothesis, by introducing the normal-bundle bootstrap method, which decomposes the probability measure and constructs data via bootstrapping on learned density ridges.

Probabilistic models of data sets often exhibit salient geometric structure. Such a phenomenon is summed up in the manifold distribution hypothesis, and can be exploited in probabilistic learning. Here we present normal-bundle bootstrap (NBB), a method that generates new data which preserve the geometric structure of a given data set. Inspired by algorithms for manifold learning and concepts in differential geometry, our method decomposes the underlying probability measure into a marginalized measure on a learned data manifold and conditional measures on the normal spaces. The algorithm estimates the data manifold as a density ridge, and constructs new data by bootstrapping projection vectors and adding them to the ridge. We apply our method to the inference of density ridge and related statistics, and data augmentation to reduce overfitting.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes