AIHCSep 24, 2024

Beyond Text-to-Text: An Overview of Multimodal and Generative Artificial Intelligence for Education Using Topic Modeling

arXiv:2409.16376v220 citationsh-index: 8
Originality Synthesis-oriented
AI Analysis

This identifies a need for more balanced research across AI modalities to better leverage multimodal technologies for educational transformation, though it is incremental as it maps existing trends rather than proposing new solutions.

The study used topic modeling to analyze 4175 articles on generative AI in education, finding a predominant focus on text-to-text models like ChatGPT while other modalities such as text-to-speech and text-to-image are underexplored, highlighting a research gap.

Generative artificial intelligence (GenAI) can reshape education and learning. While large language models (LLMs) like ChatGPT dominate current educational research, multimodal capabilities, such as text-to-speech and text-to-image, are less explored. This study uses topic modeling to map the research landscape of multimodal and generative AI in education. An extensive literature search using Dimensions yielded 4175 articles. Employing a topic modeling approach, latent topics were extracted, resulting in 38 interpretable topics organized into 14 thematic areas. Findings indicate a predominant focus on text-to-text models in educational contexts, with other modalities underexplored, overlooking the broader potential of multimodal approaches. The results suggest a research gap, stressing the importance of more balanced attention across different AI modalities and educational levels. In summary, this research provides an overview of current trends in generative AI for education, underlining opportunities for future exploration of multimodal technologies to fully realize the transformative potential of artificial intelligence in education.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes