CVNov 22, 2024

MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation

arXiv:2411.15262v224 citationsh-index: 11CVPR
Originality Synthesis-oriented
AI Analysis

It addresses the problem of evaluating and training long video generation models for researchers, though it is incremental as it focuses on dataset creation rather than a new method.

The paper tackles the lack of datasets for long video generation by introducing MovieBench, a hierarchical movie-level dataset with multi-scene narratives and character consistency, which reveals new challenges like maintaining character ID across scenes.

Recent advancements in video generation models, like Stable Video Diffusion, show promising results, but primarily focus on short, single-scene videos. These models struggle with generating long videos that involve multiple scenes, coherent narratives, and consistent characters. Furthermore, there is no publicly available dataset tailored for the analysis, evaluation, and training of long video generation models. In this paper, we present MovieBench: A Hierarchical Movie-Level Dataset for Long Video Generation, which addresses these challenges by providing unique contributions: (1) movie-length videos featuring rich, coherent storylines and multi-scene narratives, (2) consistency of character appearance and audio across scenes, and (3) hierarchical data structure contains high-level movie information and detailed shot-level descriptions. Experiments demonstrate that MovieBench brings some new insights and challenges, such as maintaining character ID consistency across multiple scenes for various characters. The dataset will be public and continuously maintained, aiming to advance the field of long video generation. Data can be found at: https://weijiawu.github.io/MovieBench/.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes