CVAIAug 4, 2025

Refine-IQA: Multi-Stage Reinforcement Finetuning for Perceptual Image Quality Assessment

arXiv:2508.03763v23 citationsh-index: 49
Originality Incremental advance
AI Analysis

This work addresses image quality assessment for computer vision applications, presenting an incremental improvement over existing reinforcement fine-tuning methods.

The paper tackled the problem of improving image quality assessment (IQA) by addressing gaps in reinforcement fine-tuning methods, such as lack of supervision for the 'think' process and insufficient enhancement of low-level visual perception, resulting in models that achieve outstanding performance on perception and scoring tasks, with notable results on a quality interpreting benchmark.

Reinforcement fine-tuning (RFT) is a proliferating paradigm for LMM training. Analogous to high-level reasoning tasks, RFT is similarly applicable to low-level vision domains, including image quality assessment (IQA). Existing RFT-based IQA methods typically use rule-based output rewards to verify the model's rollouts but provide no reward supervision for the "think" process, leaving its correctness and efficacy uncontrolled. Furthermore, these methods typically fine-tune directly on downstream IQA tasks without explicitly enhancing the model's native low-level visual quality perception, which may constrain its performance upper bound. In response to these gaps, we propose the multi-stage RFT IQA framework (Refine-IQA). In Stage-1, we build the Refine-Perception-20K dataset (with 12 main distortions, 20,907 locally-distorted images, and over 55K RFT samples) and design multi-task reward functions to strengthen the model's visual quality perception. In Stage-2, targeting the quality scoring task, we introduce a probability difference reward involved strategy for "think" process supervision. The resulting Refine-IQA Series Models achieve outstanding performance on both perception and scoring tasks-and, notably, our paradigm activates a robust "think" (quality interpreting) capability that also attains exceptional results on the corresponding quality interpreting benchmark.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes