CLAINov 4, 2024

QCG-Rerank: Chunks Graph Rerank with Query Expansion in Retrieval-Augmented LLMs for Tourism Domain

arXiv:2411.08724v14 citationsh-index: 13
Originality Incremental advance
AI Analysis

This addresses retrieval challenges in tourism-specific RAG systems, but it is incremental as it builds on existing RAG methods with domain-specific optimizations.

The paper tackles the problem of irrelevant or contradictory information in Retrieval-Augmented Generation (RAG) for the tourism domain, where queries are brief and database content is diverse, by proposing QCG-Rerank, which uses query expansion and a chunks graph to improve retrieval, and demonstrates effectiveness on multiple datasets.

Retrieval-Augmented Generation (RAG) mitigates the issue of hallucination in Large Language Models (LLMs) by integrating information retrieval techniques. However, in the tourism domain, since the query is usually brief and the content in the database is diverse, existing RAG may contain a significant amount of irrelevant or contradictory information contents after retrieval. To address this challenge, we propose the QCG-Rerank model. This model first performs an initial retrieval to obtain candidate chunks and then enhances semantics by extracting critical information to expand the original query. Next, we utilize the expanded query and candidate chunks to calculate similarity scores as the initial transition probability and construct the chunks graph. Subsequently, We iteratively compute the transition probabilities based on an initial estimate until convergence. The chunks with the highest score are selected and input into the LLMs to generate responses. We evaluate the model on Cultour, IIRC, StrategyQA, HotpotQA, SQuAD, and MuSiQue datasets. The experimental results demonstrate the effectiveness and superiority of the QCG-Rerank method.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes