IVCVFeb 5, 2024

Rethinking RGB Color Representation for Image Restoration Models

arXiv:2402.03399v16 citationsh-index: 14
AI Analysis

This work addresses a specific bottleneck in image restoration for computer vision applications, offering an incremental improvement over existing methods.

The authors tackled the problem of blurry and unrealistic textures in image restoration models by augmenting the RGB color representation to include structural information, resulting in improved performance without affecting evaluation. Notably, this approach consistently enhanced overall metrics, such as overcoming the perception-distortion trade-off when combined with auxiliary objectives.

Image restoration models are typically trained with a pixel-wise distance loss defined over the RGB color representation space, which is well known to be a source of blurry and unrealistic textures in the restored images. The reason, we believe, is that the three-channel RGB space is insufficient for supervising the restoration models. To this end, we augment the representation to hold structural information of local neighborhoods at each pixel while keeping the color information and pixel-grainedness unharmed. The result is a new representation space, dubbed augmented RGB ($a$RGB) space. Substituting the underlying representation space for the per-pixel losses facilitates the training of image restoration models, thereby improving the performance without affecting the evaluation phase. Notably, when combined with auxiliary objectives such as adversarial or perceptual losses, our $a$RGB space consistently improves overall metrics by reconstructing both color and local structures, overcoming the conventional perception-distortion trade-off.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes