CL IRDec 23, 2024

RAGONITE: Iterative Retrieval on Induced Databases and Verbalized RDF for Conversational QA over KGs with RAG

Rishiraj Saha Roy, Chris Hinze, Joel Schlotthauer, Farzad Naderi, Viktor Hangya, Andreas Foltyn, Luzian Hahn, Fabian Kuech

arXiv:2412.17690v31.92 citationsh-index: 22BTW

Originality Incremental advance

AI Analysis

This addresses the brittleness of SPARQL for complex intents in conversational QA, offering a domain-specific solution for knowledge graph search.

The paper tackles the problem of conversational question answering over RDF knowledge graphs by proposing a system that fuses SQL-query results from a derived database and text-search results over verbalized KG facts, with iterative retrieval and RAG, achieving superiority over baselines on a BMW automobile KG.

Conversational question answering (ConvQA) is a convenient means of searching over RDF knowledge graphs (KGs), where a prevalent approach is to translate natural language questions to SPARQL queries. However, SPARQL has certain shortcomings: (i) it is brittle for complex intents and conversational questions, and (ii) it is not suitable for more abstract needs. Instead, we propose a novel two-pronged system where we fuse: (i) SQL-query results over a database automatically derived from the KG, and (ii) text-search results over verbalizations of KG facts. Our pipeline supports iterative retrieval: when the results of any branch are found to be unsatisfactory, the system can automatically opt for further rounds. We put everything together in a retrieval augmented generation (RAG) setup, where an LLM generates a coherent response from accumulated search results. We demonstrate the superiority of our proposed system over several baselines on a knowledge graph of BMW automobiles.

View on arXiv PDF

Similar