ProcData: An R Package for Process Data Analysis
This work addresses the need for better educational assessment tools by offering a software package for researchers and practitioners, but it is incremental as it builds on existing methods like neural networks and feature extraction.
The authors tackled the challenge of analyzing educational assessment process data by developing an R package called ProcData, which provides tools for processing, describing, and analyzing timestamped action sequences, including feature extraction and neural-network-based sequence modeling.
Process data refer to data recorded in the log files of computer-based items. These data, represented as timestamped action sequences, keep track of respondents' response processes of solving the items. Process data analysis aims at enhancing educational assessment accuracy and serving other assessment purposes by utilizing the rich information contained in response processes. The R package ProcData presented in this article is designed to provide tools for processing, describing, and analyzing process data. We define an S3 class "proc" for organizing process data and extend generic methods summary and print for class "proc". Two feature extraction methods for process data are implemented in the package for compressing information in the irregular response processes into regular numeric vectors. ProcData also provides functions for fitting and making predictions from a neural-network-based sequence model. These functions call relevant functions in package keras for constructing and training neural networks. In addition, several response process generators and a real dataset of response processes of the climate control item in the 2012 Programme for International Student Assessment are included in the package.