LGJun 19, 2024

Scalable unsupervised alignment of general metric and non-metric structures

arXiv:2406.13507v12 citations
Originality Incremental advance
AI Analysis

This addresses a fundamental alignment problem in machine learning with applications in areas like single-cell multiomics, though it appears incremental as it builds on prior work by improving scalability and flexibility.

The paper tackles the problem of aligning data from different domains, such as single-cell multiomics, by formulating it as a quadratic assignment problem and proposing a method to learn a scalable linear assignment problem that minimizes it, achieving state-of-the-art performance on synthetic and real datasets.

Aligning data from different domains is a fundamental problem in machine learning with broad applications across very different areas, most notably aligning experimental readouts in single-cell multiomics. Mathematically, this problem can be formulated as the minimization of disagreement of pair-wise quantities such as distances and is related to the Gromov-Hausdorff and Gromov-Wasserstein distances. Computationally, it is a quadratic assignment problem (QAP) that is known to be NP-hard. Prior works attempted to solve the QAP directly with entropic or low-rank regularization on the permutation, which is computationally tractable only for modestly-sized inputs, and encode only limited inductive bias related to the domains being aligned. We consider the alignment of metric structures formulated as a discrete Gromov-Wasserstein problem and instead of solving the QAP directly, we propose to learn a related well-scalable linear assignment problem (LAP) whose solution is also a minimizer of the QAP. We also show a flexible extension of the proposed framework to general non-metric dissimilarities through differentiable ranks. We extensively evaluate our approach on synthetic and real datasets from single-cell multiomics and neural latent spaces, achieving state-of-the-art performance while being conceptually and computationally simple.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes