LGMay 1, 2022

Uniform Manifold Approximation with Two-phase Optimization

arXiv:2205.00420v222 citationsh-index: 27
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in dimensionality reduction for data analysis, offering an incremental improvement over existing methods.

The paper tackles the problem of improving UMAP to better capture global structure in dimensionality reduction, resulting in a method that outperforms widely used techniques in global structure preservation while maintaining competitive local structure accuracy.

We introduce Uniform Manifold Approximation with Two-phase Optimization (UMATO), a dimensionality reduction (DR) technique that improves UMAP to capture the global structure of high-dimensional data more accurately. In UMATO, optimization is divided into two phases so that the resulting embeddings can depict the global structure reliably while preserving the local structure with sufficient accuracy. In the first phase, hub points are identified and projected to construct a skeletal layout for the global structure. In the second phase, the remaining points are added to the embedding preserving the regional characteristics of local areas. Through quantitative experiments, we found that UMATO (1) outperformed widely used DR techniques in preserving the global structure while (2) producing competitive accuracy in representing the local structure. We also verified that UMATO is preferable in terms of robustness over diverse initialization methods, number of epochs, and subsampling techniques.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes