CLAIMay 21, 2024

Retrieval-Augmented Language Model for Extreme Multi-Label Knowledge Graph Link Prediction

arXiv:2405.12656v12 citationsh-index: 1Has Code
Originality Incremental advance
AI Analysis

This addresses the need for truthful, low-cost extrapolation in specialized domains, though it appears incremental as it builds on existing retrieval-augmented methods.

The paper tackles the problem of hallucination and high training costs in large language models for open-ended inquiry by proposing a retrieval-augmented framework for extreme multi-label knowledge graph link prediction, demonstrating that using textual data and tailored strategies improves performance with a small model size.

Extrapolation in Large language models (LLMs) for open-ended inquiry encounters two pivotal issues: (1) hallucination and (2) expensive training costs. These issues present challenges for LLMs in specialized domains and personalized data, requiring truthful responses and low fine-tuning costs. Existing works attempt to tackle the problem by augmenting the input of a smaller language model with information from a knowledge graph (KG). However, they have two limitations: (1) failing to extract relevant information from a large one-hop neighborhood in KG and (2) applying the same augmentation strategy for KGs with different characteristics that may result in low performance. Moreover, open-ended inquiry typically yields multiple responses, further complicating extrapolation. We propose a new task, the extreme multi-label KG link prediction task, to enable a model to perform extrapolation with multiple responses using structured real-world knowledge. Our retriever identifies relevant one-hop neighbors by considering entity, relation, and textual data together. Our experiments demonstrate that (1) KGs with different characteristics require different augmenting strategies, and (2) augmenting the language model's input with textual data improves task performance significantly. By incorporating the retrieval-augmented framework with KG, our framework, with a small parameter size, is able to extrapolate based on a given KG. The code can be obtained on GitHub: https://github.com/exiled1143/Retrieval-Augmented-Language-Model-for-Multi-Label-Knowledge-Graph-Link-Prediction.git

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes