Generating Multiple-Choice Knowledge Questions with Interpretable Difficulty Estimation using Knowledge Graphs and Large Language Models

Mehmet Can Şakiroğlu, H. Altay Güvenir, Kamer Kaya

arXiv:2604.107483.7h-index: 2

AI Analysis

This work addresses the need for automated MCQ generation with accurate difficulty estimation in adaptive education, offering an interpretable alternative to black-box models.

The study proposes a method to generate multiple-choice questions (MCQs) with interpretable difficulty estimation by combining knowledge graphs (KGs) and large language models (LLMs). The approach constructs a KG from input documents, generates MCQs from graph components, and computes nine difficulty signals to produce a unified score that aligns with human perceptions.

Generating multiple-choice questions (MCQs) with difficulty estimation remains challenging in automated MCQ-generation systems used in adaptive, AI-assisted education. This study proposes a novel methodology for generating MCQs with difficulty estimation from the input documents by utilizing knowledge graphs (KGs) and large language models (LLMs). Our approach uses an LLM to construct a KG from input documents, from which MCQs are then systematically generated. Each MCQ is generated by selecting a node from the KG as the key, sampling a related triple or quintuple -- optionally augmented with an extra triple -- and prompting an LLM to generate a corresponding stem from these graph components. Distractors are then selected from the KG. For each MCQ, nine difficulty signals are computed and combined into a unified difficulty score using a data-driven approach. Experimental results demonstrate that our method generates high-quality MCQs whose difficulty estimation is interpretable and aligns with human perceptions. Our approach improves automated MCQ generation by integrating structured knowledge representations with LLMs and a data-driven difficulty estimation model.

View on arXiv PDF

Similar