CVAILGNov 25, 2023

Unbalancedness in Neural Monge Maps Improves Unpaired Domain Translation

arXiv:2311.15100v233 citationsh-index: 33
Originality Incremental advance
AI Analysis

This addresses a bottleneck in domain translation for fields like biology and computer vision, though it is incremental as it builds on existing neural OT frameworks.

The paper tackled the problem of mass conservation in optimal transport for unpaired domain translation, which is prone to outliers and limits real-world applicability, by proposing a method to incorporate unbalancedness into neural Monge map estimators, resulting in improved performance in tasks like cell trajectory modeling and image translation, with UOT-FM better preserving relevant features.

In optimal transport (OT), a Monge map is known as a mapping that transports a source distribution to a target distribution in the most cost-efficient way. Recently, multiple neural estimators for Monge maps have been developed and applied in diverse unpaired domain translation tasks, e.g. in single-cell biology and computer vision. However, the classic OT framework enforces mass conservation, which makes it prone to outliers and limits its applicability in real-world scenarios. The latter can be particularly harmful in OT domain translation tasks, where the relative position of a sample within a distribution is explicitly taken into account. While unbalanced OT tackles this challenge in the discrete setting, its integration into neural Monge map estimators has received limited attention. We propose a theoretically grounded method to incorporate unbalancedness into any Monge map estimator. We improve existing estimators to model cell trajectories over time and to predict cellular responses to perturbations. Moreover, our approach seamlessly integrates with the OT flow matching (OT-FM) framework. While we show that OT-FM performs competitively in image translation, we further improve performance by incorporating unbalancedness (UOT-FM), which better preserves relevant features. We hence establish UOT-FM as a principled method for unpaired image translation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes