CLDec 17, 2025

From NLG Evaluation to Modern Student Assessment in the Era of ChatGPT: The Great Misalignment Problem and Pedagogical Multi-Factor Assessment (P-MFA)

arXiv:2512.15183v11 citationsh-index: 5
Originality Synthesis-oriented
AI Analysis

This addresses assessment challenges in education due to AI tools like ChatGPT, but it is incremental as it adapts existing evaluation concepts to a new context.

The paper tackles the misalignment between traditional student assessment and modern AI tool usage by proposing the Pedagogical Multi-Factor Assessment (P-MFA) model, a process-based framework to improve validity in grading.

This paper explores the growing epistemic parallel between NLG evaluation and grading of students in a Finnish University. We argue that both domains are experiencing a Great Misalignment Problem. As students increasingly use tools like ChatGPT to produce sophisticated outputs, traditional assessment methods that focus on final products rather than learning processes have lost their validity. To address this, we introduce the Pedagogical Multi-Factor Assessment (P-MFA) model, a process-based, multi-evidence framework inspired by the logic of multi-factor authentication.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes