CVNov 24, 2025

Q-Save: Towards Scoring and Attribution for Generated Video Evaluation

arXiv:2511.18825v22 citations
Originality Incremental advance
AI Analysis

This addresses the need for systematic and holistic evaluation of AI-generated videos, which is incremental as it builds on existing datasets and methods by integrating multiple dimensions.

The paper tackles the problem of evaluating AI-generated video quality by introducing Q-Save, a benchmark dataset with nearly 10,000 annotated videos and a unified model that achieves superior performance in quality prediction while providing interpretable justifications.

Evaluating AI-generated video (AIGV) quality hinges on three crucial dimensions: visual quality, dynamic quality, and text-video alignment. While numerous evaluation datasets and algorithms have been proposed, existing approaches are constrained by two limitations: the absence of systematic definitions for evaluation dimensions, and the isolated treatment of the three dimensions in separate models. Therefore, we introduce Q-Save, a holistic benchmark dataset and unified evaluation model for AIGV quality assessment. The Q-Save dataset contains nearly 10,000 video samples, each annotated with Mean Opinion Scores (MOS) and fine-grained attribution explanations across the three core dimensions. Leveraging this attribution-annotated dataset, we train the proposed Q-Save model, which adopts the SlowFast framework to balance accuracy and efficiency, and employs a three-stage training strategy with Chain-of-Thought (COT) formatted data: Supervised Fine-Tuning (SFT), Grouped Relative Policy Optimization (GRPO), and a final SFT round for stability, to jointly perform quality scoring and attribution generation. Experimental results demonstrate that Q-Save achieves superior performance in AIGV quality prediction while providing interpretable justifications. Code and dataset will be released upon publication.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes