AICLMASep 2, 2025

How Real Is AI Tutoring? Comparing Simulated and Human Dialogues in One-on-One Instruction

arXiv:2509.01914v13 citationsh-index: 5
Originality Incremental advance
AI Analysis

This addresses the problem of AI limitations in educational tutoring for students and educators, providing empirical guidance for improvement, though it is incremental as it builds on existing analysis methods.

The study compared AI-simulated and human tutoring dialogues, finding that human dialogues were significantly better in utterance length, questioning, and feedback behaviors, with human interactions being more cognitively guided and diverse, while AI dialogues showed structural simplification and convergence.

Heuristic and scaffolded teacher-student dialogues are widely regarded as critical for fostering students' higher-order thinking and deep learning. However, large language models (LLMs) currently face challenges in generating pedagogically rich interactions. This study systematically investigates the structural and behavioral differences between AI-simulated and authentic human tutoring dialogues. We conducted a quantitative comparison using an Initiation-Response-Feedback (IRF) coding scheme and Epistemic Network Analysis (ENA). The results show that human dialogues are significantly superior to their AI counterparts in utterance length, as well as in questioning (I-Q) and general feedback (F-F) behaviors. More importantly, ENA results reveal a fundamental divergence in interactional patterns: human dialogues are more cognitively guided and diverse, centered around a "question-factual response-feedback" teaching loop that clearly reflects pedagogical guidance and student-driven thinking; in contrast, simulated dialogues exhibit a pattern of structural simplification and behavioral convergence, revolving around an "explanation-simplistic response" loop that is essentially a simple information transfer between the teacher and student. These findings illuminate key limitations in current AI-generated tutoring and provide empirical guidance for designing and evaluating more pedagogically effective generative educational dialogue systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes