CVSep 15, 2024

EditBoard: Towards a Comprehensive Evaluation Benchmark for Text-Based Video Editing Models

arXiv:2409.09668v26 citationsh-index: 1
AI Analysis

This addresses the problem of inconsistent and limited evaluations for researchers and developers in the text-based video editing field, though it is incremental as it builds on existing evaluation practices.

The authors tackled the lack of comprehensive evaluation benchmarks for text-based video editing models by proposing EditBoard, which includes nine automatic metrics across four dimensions and four task categories, resulting in a standardized framework for objective assessment.

The rapid development of diffusion models has significantly advanced AI-generated content (AIGC), particularly in Text-to-Image (T2I) and Text-to-Video (T2V) generation. Text-based video editing, leveraging these generative capabilities, has emerged as a promising field, enabling precise modifications to videos based on text prompts. Despite the proliferation of innovative video editing models, there is a conspicuous lack of comprehensive evaluation benchmarks that holistically assess these models' performance across various dimensions. Existing evaluations are limited and inconsistent, typically summarizing overall performance with a single score, which obscures models' effectiveness on individual editing tasks. To address this gap, we propose EditBoard, the first comprehensive evaluation benchmark for text-based video editing models. EditBoard encompasses nine automatic metrics across four dimensions, evaluating models on four task categories and introducing three new metrics to assess fidelity. This task-oriented benchmark facilitates objective evaluation by detailing model performance and providing insights into each model's strengths and weaknesses. By open-sourcing EditBoard, we aim to standardize evaluation and advance the development of robust video editing models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes