LGCVGTJan 23

Understanding and Improving UMAP with Geometric and Topological Priors: The JORC-UMAP Algorithm

arXiv:2601.16552v1h-index: 1
Originality Highly original
AI Analysis

This work provides a geometry-aware enhancement to UMAP for more faithful data visualization, addressing a specific bottleneck in nonlinear dimensionality reduction.

The paper tackled UMAP's limitations in capturing manifold geometry by introducing JORC-UMAP, which uses geometric and topological priors to reduce tearing and collapse, resulting in improved SVM accuracy and triplet preservation scores on synthetic and real-world datasets.

Nonlinear dimensionality reduction techniques, particularly UMAP, are widely used for visualizing high-dimensional data. However, UMAP's local Euclidean distance assumption often fails to capture intrinsic manifold geometry, leading to topological tearing and structural collapse. We identify UMAP's sensitivity to the k-nearest neighbor graph as a key cause. To address this, we introduce Ollivier-Ricci curvature as a geometric prior, reinforcing edges at geometric bottlenecks and reducing redundant links. Since curvature estimation is noise-sensitive, we also incorporate a topological prior using Jaccard similarity to ensure neighborhood consistency. The resulting method, JORC-UMAP, better distinguishes true manifold structure from spurious connections. Experiments on synthetic and real-world datasets show that JORC-UMAP reduces tearing and collapse more effectively than standard UMAP and other DR methods, as measured by SVM accuracy and triplet preservation scores, while maintaining computational efficiency. This work offers a geometry-aware enhancement to UMAP for more faithful data visualization.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes