CLDec 16, 2024

Are You Doubtful? Oh, It Might Be Difficult Then! Exploring the Use of Model Uncertainty for Question Difficulty Estimation

Leonidas Zotos, Hedderik van Rijn, Malvina Nissim

arXiv:2412.11831v24.26 citationsh-index: 26EDM

Originality Incremental advance

AI Analysis

This provides a cost-effective tool for teachers and students to assess question difficulty in educational settings, though it is incremental as it builds on existing methods with a novel feature.

The authors tackled automatic estimation of multiple-choice question difficulty by leveraging model uncertainty from Large Language Models, achieving state-of-the-art results on the USMLE and CMCQRD datasets.

In an educational setting, an estimate of the difficulty of multiple-choice questions (MCQs), a commonly used strategy to assess learning progress, constitutes very useful information for both teachers and students. Since human assessment is costly from multiple points of view, automatic approaches to MCQ item difficulty estimation are investigated, yielding however mixed success until now. Our approach to this problem takes a different angle from previous work: asking various Large Language Models to tackle the questions included in three different MCQ datasets, we leverage model uncertainty to estimate item difficulty. By using both model uncertainty features as well as textual features in a Random Forest regressor, we show that uncertainty features contribute substantially to difficulty prediction, where difficulty is inversely proportional to the number of students who can correctly answer a question. In addition to showing the value of our approach, we also observe that our model achieves state-of-the-art results on the USMLE and CMCQRD publicly available datasets.

View on arXiv PDF

Similar