LG CYJan 27, 2021

On the Interpretability of Deep Learning Based Models for Knowledge Tracing

arXiv:2101.11335v14.42 citations

Originality Synthesis-oriented

AI Analysis

This work addresses interpretability issues in deep learning models for knowledge tracing, which is crucial for improving Intelligent Tutoring Systems, but it is incremental as it builds on prior analyses with extended data and discussions.

The paper tackles the problem of interpretability in deep learning-based knowledge tracing models, revealing critical pitfalls in Deep Knowledge Tracing (DKT) such as learning an 'ability' model instead of tracking skills and improper learning of recurrence relations, with findings supported by analyses on a larger dataset and comparisons to untrained networks.

Knowledge tracing allows Intelligent Tutoring Systems to infer which topics or skills a student has mastered, thus adjusting curriculum accordingly. Deep Learning based models like Deep Knowledge Tracing (DKT) and Dynamic Key-Value Memory Network (DKVMN) have achieved significant improvements compared with models like Bayesian Knowledge Tracing (BKT) and Performance Factors Analysis (PFA). However, these deep learning based models are not as interpretable as other models because the decision-making process learned by deep neural networks is not wholly understood by the research community. In previous work, we critically examined the DKT model, visualizing and analyzing the behaviors of DKT in high dimensional space. In this work, we extend our original analyses with a much larger dataset and add discussions about the memory states of the DKVMN model. We discover that Deep Knowledge Tracing has some critical pitfalls: 1) instead of tracking each skill through time, DKT is more likely to learn an `ability' model; 2) the recurrent nature of DKT reinforces irrelevant information that it uses during the tracking task; 3) an untrained recurrent network can achieve similar results to a trained DKT model, supporting a conclusion that recurrence relations are not properly learned and, instead, improvements are simply a benefit of projection into a high dimensional, sparse vector space. Based on these observations, we propose improvements and future directions for conducting knowledge tracing research using deep neural network models.

View on arXiv PDF

Similar