CVSep 15, 2025

Character-Centric Understanding of Animated Movies

Oxford
arXiv:2509.12204v11 citationsh-index: 49MM
Originality Incremental advance
AI Analysis

This work addresses accessibility and narrative comprehension issues for visually and hearing impaired audiences in animated movies, representing a domain-specific advancement.

The paper tackles the challenge of recognizing animated characters, which have diverse appearances, by proposing an audio-visual pipeline that automatically builds a character bank from online sources, achieving robust recognition and enabling applications like audio description generation and character-aware subtitling.

Animated movies are captivating for their unique character designs and imaginative storytelling, yet they pose significant challenges for existing recognition systems. Unlike the consistent visual patterns detected by conventional face recognition methods, animated characters exhibit extreme diversity in their appearance, motion, and deformation. In this work, we propose an audio-visual pipeline to enable automatic and robust animated character recognition, and thereby enhance character-centric understanding of animated movies. Central to our approach is the automatic construction of an audio-visual character bank from online sources. This bank contains both visual exemplars and voice (audio) samples for each character, enabling subsequent multi-modal character recognition despite long-tailed appearance distributions. Building on accurate character recognition, we explore two downstream applications: Audio Description (AD) generation for visually impaired audiences, and character-aware subtitling for the hearing impaired. To support research in this domain, we introduce CMD-AM, a new dataset of 75 animated movies with comprehensive annotations. Our character-centric pipeline demonstrates significant improvements in both accessibility and narrative comprehension for animated content over prior face-detection-based approaches. For the code and dataset, visit https://www.robots.ox.ac.uk/~vgg/research/animated_ad/.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes