CVMar 11

Too Vivid to Be Real? Benchmarking and Calibrating Generative Color Fidelity

arXiv:2603.10990v128.71 citationsh-index: 26Has Code
Predicted impact top 16% in CV · last 90 daysOriginality Incremental advance
AI Analysis

This addresses the issue of color bias in realistic image generation for users of text-to-image models, though it is incremental as it builds on existing evaluation and generation frameworks.

The paper tackles the problem of text-to-image models producing unrealistic colors in realistic-style images by introducing a dataset and metric for evaluating color fidelity, and a method to improve it, achieving enhanced color authenticity in generations.

Recent advances in text-to-image (T2I) generation have greatly improved visual quality, yet producing images that appear visually authentic to real-world photography remains challenging. This is partly due to biases in existing evaluation paradigms: human ratings and preference-trained metrics often favor visually vivid images with exaggerated saturation and contrast, which make generations often too vivid to be real even when prompted for realistic-style images. To address this issue, we present Color Fidelity Dataset (CFD) and Color Fidelity Metric (CFM) for objective evaluation of color fidelity in realistic-style generations. CFD contains over 1.3M real and synthetic images with ordered levels of color realism, while CFM employs a multimodal encoder to learn perceptual color fidelity. In addition, we propose a training-free Color Fidelity Refinement (CFR) that adaptively modulates spatial-temporal guidance scale in generation, thereby enhancing color authenticity. Together, CFD supports CFM for assessment, whose learned attention further guides CFR to refine T2I fidelity, forming a progressive framework for assessing and improving color fidelity in realistic-style T2I generation. The dataset and code are available at https://github.com/ZhengyaoFang/CFM.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes