IRAIApr 25, 2022

Incorporating Explicit Knowledge in Pre-trained Language Models for Passage Re-ranking

arXiv:2204.11673v128 citationsh-index: 50
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in information retrieval for domain-specific applications, representing an incremental improvement over existing PLM-based re-rankers.

The paper tackles vocabulary mismatch and lack of domain knowledge in passage re-ranking by incorporating explicit knowledge from knowledge graphs, achieving improved performance, particularly on queries requiring in-depth domain knowledge.

Passage re-ranking is to obtain a permutation over the candidate passage set from retrieval stage. Re-rankers have been boomed by Pre-trained Language Models (PLMs) due to their overwhelming advantages in natural language understanding. However, existing PLM based re-rankers may easily suffer from vocabulary mismatch and lack of domain specific knowledge. To alleviate these problems, explicit knowledge contained in knowledge graph is carefully introduced in our work. Specifically, we employ the existing knowledge graph which is incomplete and noisy, and first apply it in passage re-ranking task. To leverage a reliable knowledge, we propose a novel knowledge graph distillation method and obtain a knowledge meta graph as the bridge between query and passage. To align both kinds of embedding in the latent space, we employ PLM as text encoder and graph neural network over knowledge meta graph as knowledge encoder. Besides, a novel knowledge injector is designed for the dynamic interaction between text and knowledge encoder. Experimental results demonstrate the effectiveness of our method especially in queries requiring in-depth domain knowledge.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes