CVJun 14, 2025

Retrieval Augmented Comic Image Generation

arXiv:2506.12517v1h-index: 2
Originality Incremental advance
AI Analysis

This addresses the challenge of maintaining character consistency and gesture diversity in comic image generation, which is incremental as it builds on existing image generation methods.

The paper tackles the problem of generating comic-style image sequences with consistent characters and expressive gestures, achieving effective generation of engaging comic narratives with coherent characters and dynamic interactions.

We present RaCig, a novel system for generating comic-style image sequences with consistent characters and expressive gestures. RaCig addresses two key challenges: (1) maintaining character identity and costume consistency across frames, and (2) producing diverse and vivid character gestures. Our approach integrates a retrieval-based character assignment module, which aligns characters in textual prompts with reference images, and a regional character injection mechanism that embeds character features into specified image regions. Experimental results demonstrate that RaCig effectively generates engaging comic narratives with coherent characters and dynamic interactions. The source code will be publicly available to support further research in this area.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes