Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models
This addresses the challenge of complex and resource-intensive 3D storytelling for creators, though it appears incremental by building on existing LLM and procedural modeling techniques.
The paper tackles the problem of creating comprehensive 3D visualizations from narratives by introducing Story3D-Agent, which uses large language models to transform stories into 3D-rendered scenes with precise control over characters and elements, validated for effectiveness.
Traditional visual storytelling is complex, requiring specialized knowledge and substantial resources, yet often constrained by human creativity and creation precision. While Large Language Models (LLMs) enhance visual storytelling, current approaches often limit themselves to 2D visuals or oversimplify stories through motion synthesis and behavioral simulation, failing to create comprehensive, multi-dimensional narratives. To this end, we present Story3D-Agent, a pioneering approach that leverages the capabilities of LLMs to transform provided narratives into 3D-rendered visualizations. By integrating procedural modeling, our approach enables precise control over multi-character actions and motions, as well as diverse decorative elements, ensuring the long-range and dynamic 3D representation. Furthermore, our method supports narrative extension through logical reasoning, ensuring that generated content remains consistent with existing conditions. We have thoroughly evaluated our Story3D-Agent to validate its effectiveness, offering a basic framework to advance 3D story representation.