HCAIDec 15, 2024

Do Tutors Learn from Equity Training and Can Generative AI Assess It?

CMU
arXiv:2412.11255v18 citationsh-index: 11LAK
Originality Synthesis-oriented
AI Analysis

This addresses the lack of scalable tools for teaching and assessing equity skills in education, though it is incremental in applying existing AI methods to a new domain.

The study evaluated whether tutors improved their equity skills after training and tested generative AI models for assessing those skills, finding marginally significant learning gains and that GPT-4o performed well in assessment tasks.

Equity is a core concern of learning analytics. However, applications that teach and assess equity skills, particularly at scale are lacking, often due to barriers in evaluating language. Advances in generative AI via large language models (LLMs) are being used in a wide range of applications, with this present work assessing its use in the equity domain. We evaluate tutor performance within an online lesson on enhancing tutors' skills when responding to students in potentially inequitable situations. We apply a mixed-method approach to analyze the performance of 81 undergraduate remote tutors. We find marginally significant learning gains with increases in tutors' self-reported confidence in their knowledge in responding to middle school students experiencing possible inequities from pretest to posttest. Both GPT-4o and GPT-4-turbo demonstrate proficiency in assessing tutors ability to predict and explain the best approach. Balancing performance, efficiency, and cost, we determine that few-shot learning using GPT-4o is the preferred model. This work makes available a dataset of lesson log data, tutor responses, rubrics for human annotation, and generative AI prompts. Future work involves leveling the difficulty among scenarios and enhancing LLM prompts for large-scale grading and assessment.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes