CVGRLGAug 13, 2025

Story2Board: A Training-Free Approach for Expressive Storyboard Generation

arXiv:2508.09983v18 citationsh-index: 7
Originality Incremental advance
AI Analysis

This work addresses the need for better visual storytelling tools for creators, though it is incremental as it builds on existing diffusion models without architectural changes.

The paper tackled the problem of generating expressive storyboards from natural language by addressing limitations in existing methods that overlook spatial composition and narrative pacing, resulting in a training-free framework that produces more dynamic and coherent storyboards with improved consistency and diversity metrics.

We present Story2Board, a training-free framework for expressive storyboard generation from natural language. Existing methods narrowly focus on subject identity, overlooking key aspects of visual storytelling such as spatial composition, background evolution, and narrative pacing. To address this, we introduce a lightweight consistency framework composed of two components: Latent Panel Anchoring, which preserves a shared character reference across panels, and Reciprocal Attention Value Mixing, which softly blends visual features between token pairs with strong reciprocal attention. Together, these mechanisms enhance coherence without architectural changes or fine-tuning, enabling state-of-the-art diffusion models to generate visually diverse yet consistent storyboards. To structure generation, we use an off-the-shelf language model to convert free-form stories into grounded panel-level prompts. To evaluate, we propose the Rich Storyboard Benchmark, a suite of open-domain narratives designed to assess layout diversity and background-grounded storytelling, in addition to consistency. We also introduce a new Scene Diversity metric that quantifies spatial and pose variation across storyboards. Our qualitative and quantitative results, as well as a user study, show that Story2Board produces more dynamic, coherent, and narratively engaging storyboards than existing baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes