AICLLGAug 31, 2018

Multi-Hop Knowledge Graph Reasoning with Reward Shaping

arXiv:1808.10568v21215 citations
AI Analysis

This work addresses challenges in query answering for incomplete knowledge graphs, offering incremental improvements to enhance generalization and robustness in multi-hop reasoning tasks.

The paper tackled the problem of multi-hop reasoning over incomplete knowledge graphs by addressing low-quality rewards and spurious search trajectories in reinforcement learning setups, resulting in significant improvements over existing path-based models and competitive performance with embedding-based models on benchmark datasets.

Multi-hop reasoning is an effective approach for query answering (QA) over incomplete knowledge graphs (KGs). The problem can be formulated in a reinforcement learning (RL) setup, where a policy-based agent sequentially extends its inference path until it reaches a target. However, in an incomplete KG environment, the agent receives low-quality rewards corrupted by false negatives in the training data, which harms generalization at test time. Furthermore, since no golden action sequence is used for training, the agent can be misled by spurious search trajectories that incidentally lead to the correct answer. We propose two modeling advances to address both issues: (1) we reduce the impact of false negative supervision by adopting a pretrained one-hop embedding model to estimate the reward of unobserved facts; (2) we counter the sensitivity to spurious paths of on-policy RL by forcing the agent to explore a diverse set of paths using randomly generated edge masks. Our approach significantly improves over existing path-based KGQA models on several benchmark datasets and is comparable or better than embedding-based models.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes