CLAILGOct 4, 2022

Detect, Retrieve, Comprehend: A Flexible Framework for Zero-Shot Document-Level Question Answering

arXiv:2210.01959v34 citationsh-index: 10
Originality Incremental advance
AI Analysis

It addresses the laborious task of reading and synthesizing knowledge from technical documents for researchers, though it is incremental as it builds on existing QA methods with a focus on document-level challenges.

The paper tackles the problem of automating information extraction from scholarly documents by proposing a three-stage document-level question answering framework, achieving a +7.19 improvement in Answer-F1 over baselines on the QASPER dataset.

Researchers produce thousands of scholarly documents containing valuable technical knowledge. The community faces the laborious task of reading these documents to identify, extract, and synthesize information. To automate information gathering, document-level question answering (QA) offers a flexible framework where human-posed questions can be adapted to extract diverse knowledge. Finetuning QA systems requires access to labeled data (tuples of context, question and answer). However, data curation for document QA is uniquely challenging because the context (i.e. answer evidence passage) needs to be retrieved from potentially long, ill-formatted documents. Existing QA datasets sidestep this challenge by providing short, well-defined contexts that are unrealistic in real-world applications. We present a three-stage document QA approach: (1) text extraction from PDF; (2) evidence retrieval from extracted texts to form well-posed contexts; (3) QA to extract knowledge from contexts to return high-quality answers -- extractive, abstractive, or Boolean. Using QASPER for evaluation, our detect-retrieve-comprehend (DRC) system achieves a +7.19 improvement in Answer-F1 over existing baselines while delivering superior context selection. Our results demonstrate that DRC holds tremendous promise as a flexible framework for practical scientific document QA.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes