Knowledge-Aware Diverse Reranking for Cross-Source Question Answering
This addresses the challenge of retrieving diverse and relevant documents from large corpora for question answering, though it appears incremental as it builds on existing RAG methods.
The authors tackled the problem of retrieving question-relevant documents in cross-source question answering by developing a knowledge-aware diverse reranking RAG pipeline, which achieved first place in the SIGIR 2025 LiveRAG competition.
This paper presents Team Marikarp's solution for the SIGIR 2025 LiveRAG competition. The competition's evaluation set, automatically generated by DataMorgana from internet corpora, encompassed a wide range of target topics, question types, question formulations, audience types, and knowledge organization methods. It offered a fair evaluation of retrieving question-relevant supporting documents from a 15M documents subset of the FineWeb corpus. Our proposed knowledge-aware diverse reranking RAG pipeline achieved first place in the competition.