IRAIMay 16

UniER: A Unified Benchmark for Item-level and Path-level Exercise Recommendation

arXiv:2605.1675015.0Has Code
Predicted impact top 38% in IR · last 90 daysOriginality Incremental advance
AI Analysis

For researchers in educational recommendation systems, this benchmark enables fair comparison between previously isolated paradigms, revealing the superiority of path-level approaches.

The paper introduces UniER, a unified benchmark for item-level and path-level exercise recommendation, using Weighted Cognitive Gain (WCG) as a common metric. Results show path-level methods systematically outperform item-level methods, especially under data sparsity and noise.

Personalized exercise recommendation dynamically aligns pedagogical resources with individual knowledge mastery, which is crucial for satisfying students' dynamic learning needs in modern education. The field is currently driven by two dominant paradigms: Item-Level Exercise Recommendation (ILER) optimizes for immediate single-step state transitions, while Path-Level Exercise Recommendation (PLER) constructs coherent learning paths to maximize cumulative gains. Despite sharing the same ultimate objective, disparate evaluation setups have kept these two lines of research isolated, hindering unified benchmarking and fair comparison. To fill the gap, in this paper, we present a Unified Benchmark for Exercise Recommendation (UniER), a comprehensive evaluation framework that unifies ILER and PLER. Specifically, we introduce Weighted Cognitive Gain (WCG) as a unified metric to measure cross-paradigm algorithmic performance. Our benchmark encompasses 9 datasets spanning four generation methods, facilitating the comparison of 18 representative ILER/PLER methods. Through multi-dimensional analyses covering effectiveness, generalizability, robustness, and efficiency, our results reveal the systematic dominance of PLER and expose the pedagogical failure of ILER's fragmented recommendations under extreme sparsity and noise. Furthermore, we provide an open-source codebase of UniER to foster reproducible research and outline potential directions for future investigations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes