CLIRDec 23, 2024

RAGONITE: Iterative Retrieval on Induced Databases and Verbalized RDF for Conversational QA over KGs with RAG

arXiv:2412.17690v32 citationsh-index: 22BTW
Originality Incremental advance
AI Analysis

This addresses the brittleness of SPARQL for complex intents in conversational QA, offering a domain-specific solution for knowledge graph search.

The paper tackles the problem of conversational question answering over RDF knowledge graphs by proposing a system that fuses SQL-query results from a derived database and text-search results over verbalized KG facts, with iterative retrieval and RAG, achieving superiority over baselines on a BMW automobile KG.

Conversational question answering (ConvQA) is a convenient means of searching over RDF knowledge graphs (KGs), where a prevalent approach is to translate natural language questions to SPARQL queries. However, SPARQL has certain shortcomings: (i) it is brittle for complex intents and conversational questions, and (ii) it is not suitable for more abstract needs. Instead, we propose a novel two-pronged system where we fuse: (i) SQL-query results over a database automatically derived from the KG, and (ii) text-search results over verbalizations of KG facts. Our pipeline supports iterative retrieval: when the results of any branch are found to be unsatisfactory, the system can automatically opt for further rounds. We put everything together in a retrieval augmented generation (RAG) setup, where an LLM generates a coherent response from accumulated search results. We demonstrate the superiority of our proposed system over several baselines on a knowledge graph of BMW automobiles.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes