CL CY LGJun 27, 2024

DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions

Nigel Fernandez, Alexander Scarlatos, Wanyong Feng, Simon Woodhead, Andrew Lan

arXiv:2406.19356v214.927 citationsHas Code

Originality Highly original

AI Analysis

This addresses the challenge of automating distractor creation for math education, offering an incremental improvement over existing LLM-based methods.

The paper tackled the problem of generating high-quality distractors for math multiple-choice questions by introducing DiVERT, a variational approach that learns interpretable error representations, and showed it outperforms GPT-4o-based methods on a real-world dataset with 1,434 questions.

High-quality distractors are crucial to both the assessment and pedagogical value of multiple-choice questions (MCQs), where manually crafting ones that anticipate knowledge deficiencies or misconceptions among real students is difficult. Meanwhile, automated distractor generation, even with the help of large language models (LLMs), remains challenging for subjects like math. It is crucial to not only identify plausible distractors but also understand the error behind them. In this paper, we introduce DiVERT (Distractor Generation with Variational Errors Represented as Text), a novel variational approach that learns an interpretable representation of errors behind distractors in math MCQs. Through experiments on a real-world math MCQ dataset with 1,434 questions used by hundreds of thousands of students, we show that DiVERT, despite using a base open-source LLM with 7B parameters, outperforms state-of-the-art approaches using GPT-4o on downstream distractor generation. We also conduct a human evaluation with math educators and find that DiVERT leads to error labels that are of comparable quality to human-authored ones.

View on arXiv PDF Code

Similar