IVCVJul 6, 2021

A Theory of the Distortion-Perception Tradeoff in Wasserstein Space

arXiv:2107.02555v165 citations
Originality Highly original
AI Analysis

This provides a theoretical foundation for the perception-distortion tradeoff in image restoration, addressing an open question in the field.

The paper tackles the problem of quantifying the minimal distortion achievable under a given perception constraint in image restoration, deriving a closed-form quadratic expression for the distortion-perception function using MSE distortion and Wasserstein-2 perception index, with results applicable to Gaussian and general distributions.

The lower the distortion of an estimator, the more the distribution of its outputs generally deviates from the distribution of the signals it attempts to estimate. This phenomenon, known as the perception-distortion tradeoff, has captured significant attention in image restoration, where it implies that fidelity to ground truth images comes at the expense of perceptual quality (deviation from statistics of natural images). However, despite the increasing popularity of performing comparisons on the perception-distortion plane, there remains an important open question: what is the minimal distortion that can be achieved under a given perception constraint? In this paper, we derive a closed form expression for this distortion-perception (DP) function for the mean squared-error (MSE) distortion and the Wasserstein-2 perception index. We prove that the DP function is always quadratic, regardless of the underlying distribution. This stems from the fact that estimators on the DP curve form a geodesic in Wasserstein space. In the Gaussian setting, we further provide a closed form expression for such estimators. For general distributions, we show how these estimators can be constructed from the estimators at the two extremes of the tradeoff: The global MSE minimizer, and a minimizer of the MSE under a perfect perceptual quality constraint. The latter can be obtained as a stochastic transformation of the former.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes