MMDec 1, 2021

Mutltimodal AI Companion for Interactive Fairytale Co-creation

arXiv:2112.00331v1
AI Analysis

This addresses the need for more engaging educational tools for early childhood development, though it appears incremental by building on existing multimodal AI methods.

The paper tackles the problem of limited interaction in AI fairy tale systems for kids by proposing AI.R Taletorium, a multimodal companion that enables co-creation through sketching and text, resulting in meaningful and vivid fairy tales generated with limited training data and completing full interaction cycles under various inputs.

AI fairy tale companions play an important role in early childhood education as an augmentation for parents' efforts to close the participation gap and boost kids' mental and language development. Existing systems are generally designed to provide vivid materials as unidirectional entertaining reading environments, e.g, visualizing inputting texts. However, due to the limited vocabulary of kids, these systems failed to afford effective interaction to motivate kids to write their own fairy tales. In this work, we propose AI.R Taletorium, an illustrative, immersive, and inclusive multimodal AI companion, for interactive fairy tale co-creation that actively involves kids to create fairy tales with both the AI agent and their normal peers. AI.R Taletorium consists a neural story generator and a doodler-based fairy tale visualizer. We design a character-centric bidirectional connection mechanism between the story generator and visualizer equipped with Contrastive Language Image Pretraining (CLIP), thus enabling kids to participant in the story generation process by simple sketching. Extensive experiments and user studies show that our system was able to generate and visualize meaningful and vivid fairy tales with limited training data and complete the full interaction cycle under various inputs (text, doodler) through the bidirectional connection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes