CLJul 10, 2019

ReQA: An Evaluation for End-to-End Answer Retrieval Models

arXiv:1907.04780v21037 citations
AI Analysis

This provides a benchmark for researchers working on scalable end-to-end answer retrieval, though it is incremental as it builds on existing QA tasks.

The paper tackles the problem of evaluating large-scale answer retrieval models by introducing the ReQA benchmark, establishing baselines with neural and classical methods to assess sentence-level retrieval performance.

Popular QA benchmarks like SQuAD have driven progress on the task of identifying answer spans within a specific passage, with models now surpassing human performance. However, retrieving relevant answers from a huge corpus of documents is still a challenging problem, and places different requirements on the model architecture. There is growing interest in developing scalable answer retrieval models trained end-to-end, bypassing the typical document retrieval step. In this paper, we introduce Retrieval Question-Answering (ReQA), a benchmark for evaluating large-scale sentence-level answer retrieval models. We establish baselines using both neural encoding models as well as classical information retrieval techniques. We release our evaluation code to encourage further work on this challenging task.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes