CLIRJun 25, 2025

Knowledge-Aware Diverse Reranking for Cross-Source Question Answering

arXiv:2506.20476v11 citationsh-index: 8
Originality Synthesis-oriented
AI Analysis

This addresses the challenge of retrieving diverse and relevant documents from large corpora for question answering, though it appears incremental as it builds on existing RAG methods.

The authors tackled the problem of retrieving question-relevant documents in cross-source question answering by developing a knowledge-aware diverse reranking RAG pipeline, which achieved first place in the SIGIR 2025 LiveRAG competition.

This paper presents Team Marikarp's solution for the SIGIR 2025 LiveRAG competition. The competition's evaluation set, automatically generated by DataMorgana from internet corpora, encompassed a wide range of target topics, question types, question formulations, audience types, and knowledge organization methods. It offered a fair evaluation of retrieving question-relevant supporting documents from a 15M documents subset of the FineWeb corpus. Our proposed knowledge-aware diverse reranking RAG pipeline achieved first place in the competition.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes