CLMay 27, 2025

Ratas framework: A comprehensive genai-based approach to rubric-based marking of real-world textual exams

arXiv:2505.23818v12 citationsh-index: 3
Originality Incremental advance
AI Analysis

This addresses the problem of scalable and consistent grading for educators and students, though it appears incremental as it builds on existing AI methods for a specific domain.

The paper tackles the challenge of automated answer grading for real-world textual exams by introducing the RATAS framework, which leverages generative AI to achieve high reliability and accuracy with interpretable rationales.

Automated answer grading is a critical challenge in educational technology, with the potential to streamline assessment processes, ensure grading consistency, and provide timely feedback to students. However, existing approaches are often constrained to specific exam formats, lack interpretability in score assignment, and struggle with real-world applicability across diverse subjects and assessment types. To address these limitations, we introduce RATAS (Rubric Automated Tree-based Answer Scoring), a novel framework that leverages state-of-the-art generative AI models for rubric-based grading of textual responses. RATAS is designed to support a wide range of grading rubrics, enable subject-agnostic evaluation, and generate structured, explainable rationales for assigned scores. We formalize the automatic grading task through a mathematical framework tailored to rubric-based assessment and present an architecture capable of handling complex, real-world exam structures. To rigorously evaluate our approach, we construct a unique, contextualized dataset derived from real-world project-based courses, encompassing diverse response formats and varying levels of complexity. Empirical results demonstrate that RATAS achieves high reliability and accuracy in automated grading while providing interpretable feedback that enhances transparency for both students and nstructors.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes