AI CL MASep 2, 2025

How Real Is AI Tutoring? Comparing Simulated and Human Dialogues in One-on-One Instruction

Ruijia Li, Yuan-Hao Jiang, Jiatong Wang, Bo Jiang

arXiv:2509.01914v19.63 citationsh-index: 5

Originality Incremental advance

AI Analysis

This addresses the problem of AI limitations in educational tutoring for students and educators, providing empirical guidance for improvement, though it is incremental as it builds on existing analysis methods.

The study compared AI-simulated and human tutoring dialogues, finding that human dialogues were significantly better in utterance length, questioning, and feedback behaviors, with human interactions being more cognitively guided and diverse, while AI dialogues showed structural simplification and convergence.

Heuristic and scaffolded teacher-student dialogues are widely regarded as critical for fostering students' higher-order thinking and deep learning. However, large language models (LLMs) currently face challenges in generating pedagogically rich interactions. This study systematically investigates the structural and behavioral differences between AI-simulated and authentic human tutoring dialogues. We conducted a quantitative comparison using an Initiation-Response-Feedback (IRF) coding scheme and Epistemic Network Analysis (ENA). The results show that human dialogues are significantly superior to their AI counterparts in utterance length, as well as in questioning (I-Q) and general feedback (F-F) behaviors. More importantly, ENA results reveal a fundamental divergence in interactional patterns: human dialogues are more cognitively guided and diverse, centered around a "question-factual response-feedback" teaching loop that clearly reflects pedagogical guidance and student-driven thinking; in contrast, simulated dialogues exhibit a pattern of structural simplification and behavioral convergence, revolving around an "explanation-simplistic response" loop that is essentially a simple information transfer between the teacher and student. These findings illuminate key limitations in current AI-generated tutoring and provide empirical guidance for designing and evaluating more pedagogically effective generative educational dialogue systems.

View on arXiv PDF

Similar