CVDec 10, 2024

StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization

arXiv:2412.07375v312 citationsh-index: 7Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of inconsistent character customization in AI-generated story visuals for applications in entertainment and education, representing an incremental advance.

The paper tackled the challenge of balancing character identity preservation and text-semantics alignment in story visualization by proposing StoryWeaver, a method that uses a Character Graph and knowledge-enhanced spatial guidance, resulting in improved performance with metrics like +9.03% DINO-I and +13.44% CLIP-T.

Story visualization has gained increasing attention in artificial intelligence. However, existing methods still struggle with maintaining a balance between character identity preservation and text-semantics alignment, largely due to a lack of detailed semantic modeling of the story scene. To tackle this challenge, we propose a novel knowledge graph, namely Character Graph (\textbf{CG}), which comprehensively represents various story-related knowledge, including the characters, the attributes related to characters, and the relationship between characters. We then introduce StoryWeaver, an image generator that achieve Customization via Character Graph (\textbf{C-CG}), capable of consistent story visualization with rich text semantics. To further improve the multi-character generation performance, we incorporate knowledge-enhanced spatial guidance (\textbf{KE-SG}) into StoryWeaver to precisely inject character semantics into generation. To validate the effectiveness of our proposed method, extensive experiments are conducted using a new benchmark called TBC-Bench. The experiments confirm that our StoryWeaver excels not only in creating vivid visual story plots but also in accurately conveying character identities across various scenarios with considerable storage efficiency, \emph{e.g.}, achieving an average increase of +9.03\% DINO-I and +13.44\% CLIP-T. Furthermore, ablation experiments are conducted to verify the superiority of the proposed module. Codes and datasets are released at https://github.com/Aria-Zhangjl/StoryWeaver.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes