LGDGMLOct 16, 2024

Geometry-Aware Generative Autoencoders for Warped Riemannian Metric Learning and Generative Modeling on Data Manifolds

arXiv:2410.12779v420 citationsh-index: 17AISTATS
Originality Highly original
AI Analysis

This work addresses computational and statistical challenges in fields like single-cell RNA sequencing and spatial genomics by enabling better data generation and interpolation on data manifolds, representing a novel method for a known bottleneck.

The paper tackled the challenge of geometry-aware data generation and interpolation on high-dimensional datasets by introducing the Geometry-Aware Generative Autoencoder (GAGA), which constructs a neural network embedding space with a warped Riemannian metric, resulting in a 30% improvement over state-of-the-art methods in single-cell population-level trajectory inference.

Rapid growth of high-dimensional datasets in fields such as single-cell RNA sequencing and spatial genomics has led to unprecedented opportunities for scientific discovery, but it also presents unique computational and statistical challenges. Traditional methods struggle with geometry-aware data generation, interpolation along meaningful trajectories, and transporting populations via feasible paths. To address these issues, we introduce Geometry-Aware Generative Autoencoder (GAGA), a novel framework that combines extensible manifold learning with generative modeling. GAGA constructs a neural network embedding space that respects the intrinsic geometries discovered by manifold learning and learns a novel warped Riemannian metric on the data space. This warped metric is derived from both the points on the data manifold and negative samples off the manifold, allowing it to characterize a meaningful geometry across the entire latent space. Using this metric, GAGA can uniformly sample points on the manifold, generate points along geodesics, and interpolate between populations across the learned manifold using geodesic-guided flows. GAGA shows competitive performance in simulated and real-world datasets, including a 30% improvement over the state-of-the-art methods in single-cell population-level trajectory inference.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes