CVIVSep 16, 2025

Image Realness Assessment and Localization with Multimodal Features

arXiv:2509.13289v1h-index: 1
Originality Incremental advance
AI Analysis

This addresses the need for reliable realness assessment and localization to enhance the practical use and photorealism of AI-generated images, representing an incremental advancement.

The paper tackles the problem of quantifying perceptual realness in AI-generated images and identifying inconsistent regions, resulting in improved objective realness prediction performance and effective dense realness maps.

A reliable method of quantifying the perceptual realness of AI-generated images and identifying visually inconsistent regions is crucial for practical use of AI-generated images and for improving photorealism of generative AI via realness feedback during training. This paper introduces a framework that accomplishes both overall objective realness assessment and local inconsistency identification of AI-generated images using textual descriptions of visual inconsistencies generated by vision-language models trained on large datasets that serve as reliable substitutes for human annotations. Our results demonstrate that the proposed multimodal approach improves objective realness prediction performance and produces dense realness maps that effectively distinguish between realistic and unrealistic spatial regions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes