LLM4CD: Leveraging Large Language Models for Open-World Knowledge Augmented Cognitive Diagnosis
This addresses the cold-start problem in intelligent tutoring systems by enabling better handling of new students and exercises, though it appears incremental as it builds on existing CD frameworks with LLM augmentation.
The paper tackles the problem of cognitive diagnosis in education, which traditionally relies on ID-based methods that neglect semantic relationships and struggle with new students/exercises, by proposing LLM4CD that leverages large language models to incorporate open-world knowledge; the method outperforms previous models on multiple real-world datasets.
Cognitive diagnosis (CD) plays a crucial role in intelligent education, evaluating students' comprehension of knowledge concepts based on their test histories. However, current CD methods often model students, exercises, and knowledge concepts solely on their ID relationships, neglecting the abundant semantic relationships present within educational data space. Furthermore, contemporary intelligent tutoring systems (ITS) frequently involve the addition of new students and exercises, a situation that ID-based methods find challenging to manage effectively. The advent of large language models (LLMs) offers the potential for overcoming this challenge with open-world knowledge. In this paper, we propose LLM4CD, which Leverages Large Language Models for Open-World Knowledge Augmented Cognitive Diagnosis. Our method utilizes the open-world knowledge of LLMs to construct cognitively expressive textual representations, which are then encoded to introduce rich semantic information into the CD task. Additionally, we propose an innovative bi-level encoder framework that models students' test histories through two levels of encoders: a macro-level cognitive text encoder and a micro-level knowledge state encoder. This approach substitutes traditional ID embeddings with semantic representations, enabling the model to accommodate new students and exercises with open-world knowledge and address the cold-start problem. Extensive experimental results demonstrate that our proposed method consistently outperforms previous CD models on multiple real-world datasets, validating the effectiveness of leveraging LLMs to introduce rich semantic information into the CD task.