ML LGMar 10, 2019

$β^3$-IRT: A New Item Response Model and its Applications

Yu Chen, Telmo Silva Filho, Ricardo B. C. Prudêncio, Tom Diethe, Peter Flach

arXiv:1903.04016v37.736 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the need for more flexible assessment models in educational testing and machine learning evaluation, though it appears incremental as it builds upon existing IRT frameworks.

The paper introduces the $β^3$-IRT model, which extends Item Response Theory to handle continuous responses and a richer family of Item Characteristic Curves, and demonstrates that it outperforms the standard 2PL-ND model on online exam datasets.

Item Response Theory (IRT) aims to assess latent abilities of respondents based on the correctness of their answers in aptitude test items with different difficulty levels. In this paper, we propose the $β^3$-IRT model, which models continuous responses and can generate a much enriched family of Item Characteristic Curve (ICC). In experiments we applied the proposed model to data from an online exam platform, and show our model outperforms a more standard 2PL-ND model on all datasets. Furthermore, we show how to apply $β^3$-IRT to assess the ability of machine learning classifiers. This novel application results in a new metric for evaluating the quality of the classifier's probability estimates, based on the inferred difficulty and discrimination of data instances.

View on arXiv PDF Code

Similar