CYAICLHCIRFeb 4, 2024

Improving Assessment of Tutoring Practices using Retrieval-Augmented Generation

CMU
arXiv:2402.14594v118 citationsh-index: 86
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of real-time tutor assessment for educational training programs, but it is incremental as it applies existing methods (GPT with RAG) to a new domain.

This study tackled the challenge of automatically assessing novice math tutors' use of social-emotional strategies by evaluating GPT models with different prompting strategies, finding that Retrieval-Augmented Generation (RAG) prompts achieved more accurate performance and lower financial costs compared to other methods.

One-on-one tutoring is an effective instructional method for enhancing learning, yet its efficacy hinges on tutor competencies. Novice math tutors often prioritize content-specific guidance, neglecting aspects such as social-emotional learning. Social-emotional learning promotes equity and inclusion and nurturing relationships with students, which is crucial for holistic student development. Assessing the competencies of tutors accurately and efficiently can drive the development of tailored tutor training programs. However, evaluating novice tutor ability during real-time tutoring remains challenging as it typically requires experts-in-the-loop. To address this challenge, this preliminary study aims to harness Generative Pre-trained Transformers (GPT), such as GPT-3.5 and GPT-4 models, to automatically assess tutors' ability of using social-emotional tutoring strategies. Moreover, this study also reports on the financial dimensions and considerations of employing these models in real-time and at scale for automated assessment. The current study examined four prompting strategies: two basic Zero-shot prompt strategies, Tree of Thought prompt, and Retrieval-Augmented Generator (RAG) based prompt. The results indicate that the RAG prompt demonstrated more accurate performance (assessed by the level of hallucination and correctness in the generated assessment texts) and lower financial costs than the other strategies evaluated. These findings inform the development of personalized tutor training interventions to enhance the the educational effectiveness of tutored learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes