CVNADec 1, 2016

Monge's Optimal Transport Distance for Image Classification

arXiv:1612.00181v24 citations
Originality Synthesis-oriented
AI Analysis

This work addresses image classification challenges by proposing a novel similarity measure, though it appears incremental as it applies an existing mathematical concept to a new domain with specific algorithmic integration.

The paper tackles the problem of comparing images by introducing the Wasserstein distance from Monge's optimal transport problem, presenting an efficient numerical solution method and demonstrating its discriminatory power using a 1-Nearest Neighbour algorithm on the MNIST dataset, showing potential benefits over traditional distance metrics.

This paper focuses on a similarity measure, known as the Wasserstein distance, with which to compare images. The Wasserstein distance results from a partial differential equation (PDE) formulation of Monge's optimal transport problem. We present an efficient numerical solution method for solving Monge's problem. To demonstrate the measure's discriminatory power when comparing images, we use a $1$-Nearest Neighbour ($1$-NN) machine learning algorithm to illustrate the measure's potential benefits over other more traditional distance metrics and also the Tangent Space distance, designed to perform excellently on the well-known MNIST dataset. To our knowledge, the PDE formulation of the Wasserstein metric has not been presented for dealing with image comparison, nor has the Wasserstein distance been used within the $1$-nearest neighbour architecture.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes