CYAICLMar 18

Explainability and Certification of AI-Generated Educational Assessments

arXiv:2604.0962239.1h-index: 10
Predicted impact top 56% in CY · last 90 daysOriginality Incremental advance
AI Analysis

This addresses the problem of institutional acceptance for AI-generated assessments in education, though it appears incremental by building on existing taxonomies and methods.

The authors tackled the lack of transparency and certification in AI-generated educational assessments by proposing a framework that combines explainability techniques and a certification workflow, demonstrating feasibility with a study on 500 AI-generated questions that improved transparency and reduced instructor workload.

The rapid adoption of generative artificial intelligence (AI) in educational assessment has created new opportunities for scalable item creation, personalized feedback, and efficient formative evaluation. However, despite advances in taxonomy alignment and automated question generation, the absence of transparent, explainable, and certifiable mechanisms limits institutional and accreditation-level acceptance. This chapter proposes a comprehensive framework for explainability and certification of AI-generated assessment items, combining self-rationalization, attribution-based analysis, and post-hoc verification to produce interpretable cognitive-alignment evidence grounded in Bloom's and SOLO taxonomies. A structured certification metadata schema is introduced to capture provenance, alignment predictions, reviewer actions, and ethical indicators, enabling audit-ready documentation consistent with emerging governance requirements. A traffic-light certification workflow operationalizes these signals by distinguishing auto-certifiable items from those requiring human review or rejection. A proof-of-concept study on 500 AI-generated computer science questions demonstrates the framework's feasibility, showing improved transparency, reduced instructor workload, and enhanced auditability. The chapter concludes by outlining ethical implications, policy considerations, and directions for future research, positioning explainability and certification as essential components of trustworthy, accreditation-ready AI assessment systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes