LGAIMLJul 14, 2021

MESS: Manifold Embedding Motivated Super Sampling

arXiv:2107.06566v11 citations
Originality Synthesis-oriented
AI Analysis

This addresses data scarcity issues in high-dimensional manifold learning, though it appears incremental as it builds on existing manifold assumptions.

The paper tackles the curse of dimensionality in manifold-based machine learning by proposing a framework to generate virtual data points that faithfully approximate the manifold embedding, aiming to reduce the need for large local data densities.

Many approaches in the field of machine learning and data analysis rely on the assumption that the observed data lies on lower-dimensional manifolds. This assumption has been verified empirically for many real data sets. To make use of this manifold assumption one generally requires the manifold to be locally sampled to a certain density such that features of the manifold can be observed. However, for increasing intrinsic dimensionality of a data set the required data density introduces the need for very large data sets, resulting in one of the many faces of the curse of dimensionality. To combat the increased requirement for local data density we propose a framework to generate virtual data points that faithful to an approximate embedding function underlying the manifold observable in the data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes