LGAIIRDec 6, 2024

TOBUGraph: Knowledge Graph-Based Retrieval for Enhanced LLM Performance Beyond RAG

arXiv:2412.05447v34 citationsh-index: 39EMNLP
Originality Incremental advance
AI Analysis

This addresses retrieval challenges for LLM applications in commercial settings like personal memory organization, though it appears incremental as an enhancement to RAG.

The authors tackled limitations of Retrieval-Augmented Generation (RAG) in commercial applications by proposing TOBUGraph, a graph-based retrieval framework that constructs knowledge graphs from unstructured data and uses graph traversal for retrieval. Their evaluation on real user data showed TOBUGraph outperformed multiple RAG implementations in precision and recall, significantly improving retrieval accuracy.

Retrieval-Augmented Generation (RAG) is one of the leading and most widely used techniques for enhancing LLM retrieval capabilities, but it still faces significant limitations in commercial use cases. RAG primarily relies on the query-chunk text-to-text similarity in the embedding space for retrieval and can fail to capture deeper semantic relationships across chunks, is highly sensitive to chunking strategies, and is prone to hallucinations. To address these challenges, we propose TOBUGraph, a graph-based retrieval framework that first constructs the knowledge graph from unstructured data dynamically and automatically. Using LLMs, TOBUGraph extracts structured knowledge and diverse relationships among data, going beyond RAG's text-to-text similarity. Retrieval is achieved through graph traversal, leveraging the extracted relationships and structures to enhance retrieval accuracy, eliminating the need for chunking configurations while reducing hallucination. We demonstrate TOBUGraph's effectiveness in TOBU, a real-world application in production for personal memory organization and retrieval. Our evaluation using real user data demonstrates that TOBUGraph outperforms multiple RAG implementations in both precision and recall, significantly improving user experience through improved retrieval accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes