CLDec 3, 2025

AITutor-EvalKit: Exploring the Capabilities of AI Tutors

Numaan Naeem, Kaushal Kumar Maurya, Kseniia Petukhova, Ekaterina Kochmar

arXiv:2512.03688v12.7h-index: 9

Originality Synthesis-oriented

AI Analysis

This tool addresses the need for better evaluation methods in AI-driven education for stakeholders and the ACL community, though it appears incremental as it builds on existing language technology without introducing a new paradigm.

The authors tackled the problem of evaluating the pedagogical quality of AI tutors by developing AITutor-EvalKit, a tool that uses language technology to assess AI tutors and includes features for demonstration, evaluation, model inspection, and data visualization.

We present AITutor-EvalKit, an application that uses language technology to evaluate the pedagogical quality of AI tutors, provides software for demonstration and evaluation, as well as model inspection and data visualization. This tool is aimed at education stakeholders as well as *ACL community at large, as it supports learning and can also be used to collect user feedback and annotations.

View on arXiv PDF

Similar