AIMay 4

SCPRM: A Schema-aware Cumulative Process Reward Model for Knowledge Graph Question Answering

arXiv:2605.0281987.7
Predicted impact top 24% in AI · last 90 daysOriginality Incremental advance
AI Analysis

For risk-sensitive KGQA tasks (e.g., medical, legal), SCPRM provides more accurate step-wise evaluation, but the improvement is incremental.

SCPRM addresses the risk compensation effect in process reward models for KG reasoning by incorporating schema-aware cumulative rewards, improving Hits@k by 1.18% on average over strong baselines in medical, legal, and CWQ KGQA tasks.

Large language models excel at complex reasoning, yet evaluating their intermediate steps remains challenging. Although process reward models provide step-wise supervision, they often suffer from a risk compensation effect, where incorrect steps are offset by later correct ones, assigning high rewards to flawed reasoning paths. This issue is further exacerbated in knowledge graph (KG) reasoning, as there may exist multiple paths between the start and end entities in the KGs, and a risky step can make the reasoning path flawed. Those limitations are problematic in risk-sensitive tasks such as medical and legal KG reasoning. To address the issues, we propose a Schema-aware Cumulative Process Reward Model (SCPRM) that evaluates reasoning paths by conditioning on the reasoning prefix , and incorporating schema distance between current reasoning step and the implicit target parsed from the query, which provides cumulative and future rewards to guide the path explorations. We further integrate SCPRM into Monte Carlo Tree Search (MCTS) as SCPRM-MCTS to conduct multi-hop reasoning on KGs for question answering (QA) tasks. Across medical and legal KGQA and CWQ, SCPRM-MCTS improves the performance of Hits@k by an average of 1.18% over strong baselines, demonstrating more accurate and risk-sensitive reasoning evaluation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes