CLAIIRMay 24, 2025

Towards an automatic method for generating topical vocabulary test forms for specific reading passages

arXiv:2505.18762v1h-index: 29
Originality Synthesis-oriented
AI Analysis

This is an incremental method for middle and high school English native speakers to predict text comprehension based on vocabulary knowledge.

The paper tackles the problem of automatically measuring students' background knowledge for specific reading passages by developing K-tool, a system that generates topical vocabulary tests, with an initial evaluation presented.

Background knowledge is typically needed for successful comprehension of topical and domain specific reading passages, such as in the STEM domain. However, there are few automated measures of student knowledge that can be readily deployed and scored in time to make predictions on whether a given student will likely be able to understand a specific content area text. In this paper, we present our effort in developing K-tool, an automated system for generating topical vocabulary tests that measure students' background knowledge related to a specific text. The system automatically detects the topic of a given text and produces topical vocabulary items based on their relationship with the topic. This information is used to automatically generate background knowledge forms that contain words that are highly related to the topic and words that share similar features but do not share high associations to the topic. Prior research indicates that performance on such tasks can help determine whether a student is likely to understand a particular text based on their knowledge state. The described system is intended for use with middle and high school student population of native speakers of English. It is designed to handle single reading passages and is not dependent on any corpus or text collection. In this paper, we describe the system architecture and present an initial evaluation of the system outputs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes