LGGNJun 18, 2025

Global Ground Metric Learning with Applications to scRNA data

arXiv:2506.15383v1h-index: 6Has CodeAISTATS
Originality Highly original
AI Analysis

This addresses the need for more accurate and interpretable distance metrics in computational biology, particularly for analyzing single-cell RNA sequencing data across multiple diseases.

The paper tackles the problem of learning a ground metric for optimal transport that generalizes across classes and distributions, resulting in improved performance in embedding, clustering, and classification tasks on scRNA-seq data.

Optimal transport provides a robust framework for comparing probability distributions. Its effectiveness is significantly influenced by the choice of the underlying ground metric. Traditionally, the ground metric has either been (i) predefined, e.g., as the Euclidean distance, or (ii) learned in a supervised way, by utilizing labeled data to learn a suitable ground metric for enhanced task-specific performance. Yet, predefined metrics typically cannot account for the inherent structure and varying importance of different features in the data, and existing supervised approaches to ground metric learning often do not generalize across multiple classes or are restricted to distributions with shared supports. To address these limitations, we propose a novel approach for learning metrics for arbitrary distributions over a shared metric space. Our method provides a distance between individual points like a global metric, but requires only class labels on a distribution-level for training. The learned global ground metric enables more accurate optimal transport distances, leading to improved performance in embedding, clustering and classification tasks. We demonstrate the effectiveness and interpretability of our approach using patient-level scRNA-seq data spanning multiple diseases.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes