CVLGIVDec 23, 2023

Fréchet Wavelet Distance: A Domain-Agnostic Metric for Image Generation

arXiv:2312.15289v318 citationsHas CodeICLR
Originality Incremental advance
AI Analysis

This addresses the need for more reliable and interpretable evaluation metrics in generative learning, particularly for researchers and practitioners working with diverse image datasets, though it is incremental as it builds on the Fréchet distance framework.

The paper tackles the problem of biased and dataset-dependent metrics for image generation by proposing the Fréchet Wavelet Distance (FWD), a domain-agnostic metric based on the Wavelet Packet Transform, which improves robustness to domain shifts and corruptions compared to existing metrics like FID and FD-DINOv2.

Modern metrics for generative learning like Fréchet Inception Distance (FID) and DINOv2-Fréchet Distance (FD-DINOv2) demonstrate impressive performance. However, they suffer from various shortcomings, like a bias towards specific generators and datasets. To address this problem, we propose the Fréchet Wavelet Distance (FWD) as a domain-agnostic metric based on the Wavelet Packet Transform ($W_p$). FWD provides a sight across a broad spectrum of frequencies in images with a high resolution, preserving both spatial and textural aspects. Specifically, we use $W_p$ to project generated and real images to the packet coefficient space. We then compute the Fréchet distance with the resultant coefficients to evaluate the quality of a generator. This metric is general-purpose and dataset-domain agnostic, as it does not rely on any pre-trained network, while being more interpretable due to its ability to compute Fréchet distance per packet, enhancing transparency. We conclude with an extensive evaluation of a wide variety of generators across various datasets that the proposed FWD can generalize and improve robustness to domain shifts and various corruptions compared to other metrics.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes