CVJul 27, 2022

Shift-tolerant Perceptual Similarity Metric

arXiv:2207.13686v165 citationsh-index: 59
Originality Incremental advance
AI Analysis

This addresses a specific issue in computer vision for researchers and practitioners using similarity metrics, but it is incremental as it builds on an existing method.

The paper tackles the problem of perceptual similarity metrics being sensitive to small alignment errors by developing a shift-tolerant metric based on LPIPS, which is robust to imperceptible shifts while maintaining consistency with human judgment.

Existing perceptual similarity metrics assume an image and its reference are well aligned. As a result, these metrics are often sensitive to a small alignment error that is imperceptible to the human eyes. This paper studies the effect of small misalignment, specifically a small shift between the input and reference image, on existing metrics, and accordingly develops a shift-tolerant similarity metric. This paper builds upon LPIPS, a widely used learned perceptual similarity metric, and explores architectural design considerations to make it robust against imperceptible misalignment. Specifically, we study a wide spectrum of neural network elements, such as anti-aliasing filtering, pooling, striding, padding, and skip connection, and discuss their roles in making a robust metric. Based on our studies, we develop a new deep neural network-based perceptual similarity metric. Our experiments show that our metric is tolerant to imperceptible shifts while being consistent with the human similarity judgment.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes