37.6DLMar 27
Interactive Evidence Maps for Visualizing and Understanding Systematic ReviewsAditi Mallavarapu, Rohan Khandare, Mokshagna Kadiyala et al.
Systematic reviews provide comprehensive syntheses of research fields. As a result, systematic reviews often emphasize synthesizing across the large bodies of literature rather than just describing the studies from which the conclusions were drawn. This risks an incomplete description of the sample - encouraging overgeneralization of the findings, obscuring connections between existing work, or overshadowing gaps in the literature. To address this challenge, we introduce interactive evidence maps; an accessible visualization tool that enables researchers to explore, filter, and analyze review data dynamically. Our approach leverages large language models to extract topic models that structure heterogeneous review data into an interactive, explorable knowledge map that supports deeper inspection beyond static tables and figures. We demonstrate the usefulness of interactive evidence maps using data from a published scoping review of pedagogical agents in K-12 education, and compare the results of the evidence map to those reported in the scoping review. Results show that interactive evidence maps complement traditional syntheses by enhancing transparency, supporting exploratory analysis, and revealing patterns and gaps that may not be easy to detect through narrative summaries alone.
CLMay 9, 2025
Tell Me Who Your Students Are: GPT Can Generate Valid Multiple-Choice Questions When Students' (Mis)Understanding Is HintedMachi Shimmei, Masaki Uto, Yuichiroh Matsubayashi et al.
The primary goal of this study is to develop and evaluate an innovative prompting technique, AnaQuest, for generating multiple-choice questions (MCQs) using a pre-trained large language model. In AnaQuest, the choice items are sentence-level assertions about complex concepts. The technique integrates formative and summative assessments. In the formative phase, students answer open-ended questions for target concepts in free text. For summative assessment, AnaQuest analyzes these responses to generate both correct and incorrect assertions. To evaluate the validity of the generated MCQs, Item Response Theory (IRT) was applied to compare item characteristics between MCQs generated by AnaQuest, a baseline ChatGPT prompt, and human-crafted items. An empirical study found that expert instructors rated MCQs generated by both AI models to be as valid as those created by human instructors. However, IRT-based analysis revealed that AnaQuest-generated questions - particularly those with incorrect assertions (foils) - more closely resembled human-crafted items in terms of difficulty and discrimination than those produced by ChatGPT.
HCAug 25, 2025
Beyond prior knowledge: The predictive role of knowledge-building in Tutor LearningTasmia Shahriar, Mia Ameen, Aditi Mallavarapu et al.
When adopting the role of a teacher in learning-by-teaching environments, students often struggle to engage in knowledge-building activities, such as providing explanations and addressing misconceptions. Instead, they frequently default to knowledge-telling behaviors, where they simply dictate what they already know or what to do without deeper reflection, thereby limiting learning. Teachable agents, particularly those capable of posing persistent follow-up questions, have been shown to encourage students (tutors) to shift from knowledge-telling to knowledge-building and enhance tutor learning. Tutor learning encompasses two interrelated types of knowledge: conceptual and procedural knowledge. Research has established a bidirectional relationship between these knowledge types, where improvements in one reinforce the other. This study investigates the role of knowledge-building in mediating the bidirectional relationship between procedural and conceptual learning. Our findings revealed a stable bidirectional relationship between procedural and conceptual knowledge, with higher post-test scores observed among students who engaged in knowledge-building, regardless of their procedural and conceptual pre-test performance. This suggests that knowledge-building serves as a crucial mechanism bridging the gap between students with low prior knowledge and higher conceptual and procedural learning gain.