CLJan 16, 2022

In Situ Answer Sentence Selection at Web-scale

Zeyu Zhang, Thuy Vu, Alessandro Moschitti

arXiv:2201.05984v10.84 citations

Originality Highly original

AI Analysis

This work addresses the computational bottleneck in deploying question answering services at Web scale, offering a more efficient and accurate solution for large-scale applications.

The paper tackles the inefficiency of answer sentence selection in open-domain question answering by introducing PEASI, a Transformer-based framework that jointly reranks passages and extracts answers in place, achieving a 6.51% accuracy improvement from 48.86% to 55.37% and reducing inference needs by ~80%.

Current answer sentence selection (AS2) applied in open-domain question answering (ODQA) selects answers by ranking a large set of possible candidates, i.e., sentences, extracted from the retrieved text. In this paper, we present Passage-based Extracting Answer Sentence In-place (PEASI), a novel design for AS2 optimized for Web-scale setting, that, instead, computes such answer without processing each candidate individually. Specifically, we design a Transformer-based framework that jointly (i) reranks passages retrieved for a question and (ii) identifies a probable answer from the top passages in place. We train PEASI in a multi-task learning framework that encourages feature sharing between the components: passage reranker and passage-based answer sentence extractor. To facilitate our development, we construct a new Web-sourced large-scale QA dataset consisting of 800,000+ labeled passages/sentences for 60,000+ questions. The experiments show that our proposed design effectively outperforms the current state-of-the-art setting for AS2, i.e., a point-wise model for ranking sentences independently, by 6.51% in accuracy, from 48.86% to 55.37%. In addition, PEASI is exceptionally efficient in computing answer sentences, requiring only ~20% inferences compared to the standard setting, i.e., reranking all possible candidates. We believe the release of PEASI, both the dataset and our proposed design, can contribute to advancing the research and development in deploying question answering services at Web scale.

View on arXiv PDF

Similar