CVApr 16

AnimationBench: Are Video Models Good at Character-Centric Animation?

arXiv:2604.1529997.8h-index: 21
Predicted impact top 4% in CV · last 90 daysOriginality Incremental advance
AI Analysis

This benchmark addresses the lack of evaluation tools for animation-style video generation, benefiting researchers and practitioners in animation and video generation.

AnimationBench is the first systematic benchmark for evaluating animation image-to-video generation, operationalizing animation principles and IP preservation into measurable dimensions. It aligns with human judgment and reveals animation-specific quality differences overlooked by realism-oriented benchmarks.

Video generation has advanced rapidly, with recent methods producing increasingly convincing animated results. However, existing benchmarks-largely designed for realistic videos-struggle to evaluate animation-style generation with its stylized appearance, exaggerated motion, and character-centric consistency. Moreover, they also rely on fixed prompt sets and rigid pipelines, offering limited flexibility for open-domain content and custom evaluation needs. To address this gap, we introduce AnimationBench, the first systematic benchmark for evaluating animation image-to-video generation. AnimationBench operationalizes the Twelve Basic Principles of Animation and IP Preservation into measurable evaluation dimensions, together with Broader Quality Dimensions including semantic consistency, motion rationality, and camera motion consistency. The benchmark supports both a standardized close-set evaluation for reproducible comparison and a flexible open-set evaluation for diagnostic analysis, and leverages visual-language models for scalable assessment. Extensive experiments show that AnimationBench aligns well with human judgment and exposes animation-specific quality differences overlooked by realism-oriented benchmarks, leading to more informative and discriminative evaluation of state-of-the-art I2V models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes