CLMay 27, 2025

LMCD: Language Models are Zeroshot Cognitive Diagnosis Learners

arXiv:2505.21239v11 citationsh-index: 3Has Code
Originality Incremental advance
AI Analysis

This addresses the challenge of assessing student cognitive states without prior interaction data, enabling more effective personalized learning in AI-empowered education, though it is an incremental advancement over existing NLP-based approaches.

The paper tackles the cold-start problem in cognitive diagnosis for personalized education by proposing LMCD, a framework that uses large language models to generate enriched content and fuse semantic and cognitive information, achieving significant performance improvements over state-of-the-art methods in exercise-cold and domain-cold settings.

Cognitive Diagnosis (CD) has become a critical task in AI-empowered education, supporting personalized learning by accurately assessing students' cognitive states. However, traditional CD models often struggle in cold-start scenarios due to the lack of student-exercise interaction data. Recent NLP-based approaches leveraging pre-trained language models (PLMs) have shown promise by utilizing textual features but fail to fully bridge the gap between semantic understanding and cognitive profiling. In this work, we propose Language Models as Zeroshot Cognitive Diagnosis Learners (LMCD), a novel framework designed to handle cold-start challenges by harnessing large language models (LLMs). LMCD operates via two primary phases: (1) Knowledge Diffusion, where LLMs generate enriched contents of exercises and knowledge concepts (KCs), establishing stronger semantic links; and (2) Semantic-Cognitive Fusion, where LLMs employ causal attention mechanisms to integrate textual information and student cognitive states, creating comprehensive profiles for both students and exercises. These representations are efficiently trained with off-the-shelf CD models. Experiments on two real-world datasets demonstrate that LMCD significantly outperforms state-of-the-art methods in both exercise-cold and domain-cold settings. The code is publicly available at https://github.com/TAL-auroraX/LMCD

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes