CLApr 2, 2024

Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models

Wanyong Feng, Jaewook Lee, Hunter McNichols, Alexander Scarlatos, Digory Smith, Simon Woodhead, Nancy Otero Ornelas, Andrew Lan

arXiv:2404.02124v318.136 citationsh-index: 10Has CodeNAACL-HLT

Originality Synthesis-oriented

AI Analysis

This work addresses the scalability issue for teachers and content designers in creating math assessments, but it is incremental as it highlights limitations of existing LLM approaches.

The study tackled the problem of automating distractor generation for math multiple-choice questions using large language models, finding that while LLMs can produce mathematically valid distractors, they are less effective at anticipating common student errors or misconceptions.

Multiple-choice questions (MCQs) are ubiquitous in almost all levels of education since they are easy to administer, grade, and are a reliable format in assessments and practices. One of the most important aspects of MCQs is the distractors, i.e., incorrect options that are designed to target common errors or misconceptions among real students. To date, the task of crafting high-quality distractors largely remains a labor and time-intensive process for teachers and learning content designers, which has limited scalability. In this work, we study the task of automated distractor generation in the domain of math MCQs and explore a wide variety of large language model (LLM)-based approaches, from in-context learning to fine-tuning. We conduct extensive experiments using a real-world math MCQ dataset and find that although LLMs can generate some mathematically valid distractors, they are less adept at anticipating common errors or misconceptions among real students.

View on arXiv PDF Code

Similar