CLSep 17, 2024

Kahaani: A Multimodal Co-Creative Storytelling System

arXiv:2409.11261v63 citationsh-index: 16
Originality Synthesis-oriented
AI Analysis

This is an incremental system for children's education, aiming to improve English skills, teach life lessons, and explain story structure through interactive storytelling.

The paper tackles the challenge of sustaining engagement in educational storytelling for children by introducing Kahaani, a multimodal co-creative system that uses generative AI to produce immersive stories, with evaluations showing positive user feedback from a small study.

This paper introduces Kahaani, a multimodal, co-creative storytelling system that leverages Generative Artificial Intelligence, designed for children to address the challenge of sustaining engagement to foster educational narrative experiences. Here we define co-creative as a collaborative creative process in which both the child and Kahaani contribute to the generation of the story. The system combines Large Language Model (LLM), Text-to-Speech (TTS), Text-to-Music (TTM), and Text-to-Video (TTV) generation to produce a rich, immersive, and accessible storytelling experience. The system grounds the co-creation process in two classical storytelling framework, Freytag's Pyramid and Propp's Narrative Functions. The main goals of Kahaani are: (1) to help children improve their English skills, (2) to teach important life lessons through story morals, and (3) to help them understand how stories are structured, all in a fun and engaging way. We present evaluations for each AI component used, along with a user study involving three parent-child pairs to assess the overall experience and educational value of the system.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes