CVApr 3

EvaNet: Towards More Efficient and Consistent Infrared and Visible Image Fusion Assessment

arXiv:2604.0289675.51 citationsh-index: 13Has Code
Predicted impact top 35% in CV · last 90 daysOriginality Incremental advance
AI Analysis

This addresses the need for efficient and consistent assessment in image fusion research, which is incremental as it builds on existing metrics but introduces a novel framework.

The paper tackles the problem of evaluating infrared and visible image fusion by proposing a lightweight network that approximates traditional metrics, achieving up to 1,000 times faster speed and greater consistency with human perception.

Evaluation is essential in image fusion research, yet most existing metrics are directly borrowed from other vision tasks without proper adaptation. These traditional metrics, often based on complex image transformations, not only fail to capture the true quality of the fusion results but also are computationally demanding. To address these issues, we propose a unified evaluation framework specifically tailored for image fusion. At its core is a lightweight network designed efficiently to approximate widely used metrics, following a divide-and-conquer strategy. Unlike conventional approaches that directly assess similarity between fused and source images, we first decompose the fusion result into infrared and visible components. The evaluation model is then used to measure the degree of information preservation in these separated components, effectively disentangling the fusion evaluation process. During training, we incorporate a contrastive learning strategy and inform our evaluation model by perceptual scene assessment provided by a large language model. Last, we propose the first consistency evaluation framework, which measures the alignment between image fusion metrics and human visual perception, using both independent no-reference scores and downstream tasks performance as objective references. Extensive experiments show that our learning-based evaluation paradigm delivers both superior efficiency (up to 1,000 times faster) and greater consistency across a range of standard image fusion benchmarks. Our code will be publicly available at https://github.com/AWCXV/EvaNet.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes