CLCYMar 9, 2025

Training LLM-based Tutors to Improve Student Learning Outcomes in Dialogues

arXiv:2503.06424v242 citationsh-index: 10Has CodeAIED
Originality Incremental advance
AI Analysis

This work addresses the challenge of improving personalized tutoring for students using AI, though it is incremental as it builds on existing LLM-based tutoring methods.

The paper tackles the problem of suboptimal AI tutoring dialogues by training an LLM to generate tutor utterances that maximize student correctness while maintaining pedagogical quality, resulting in significantly higher chances of correct student responses.

Generative artificial intelligence (AI) has the potential to scale up personalized tutoring through large language models (LLMs). Recent AI tutors are adapted for the tutoring task by training or prompting LLMs to follow effective pedagogical principles, though they are not trained to maximize student learning throughout the course of a dialogue. Therefore, they may engage with students in a suboptimal way. We address this limitation by introducing an approach to train LLMs to generate tutor utterances that maximize the likelihood of student correctness, while still encouraging the model to follow good pedagogical practice. Specifically, we generate a set of candidate tutor utterances and score them using (1) an LLM-based student model to predict the chance of correct student responses and (2) a pedagogical rubric evaluated by GPT-4o. We then use the resulting data to train an open-source LLM, Llama 3.1 8B, using direct preference optimization. We show that tutor utterances generated by our model lead to significantly higher chances of correct student responses while maintaining the pedagogical quality of GPT-4o. We also conduct qualitative analyses and a human evaluation to demonstrate that our model generates high quality tutor utterances.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes