CLMay 12, 2023

Open-WikiTable: Dataset for Open Domain Question Answering with Complex Reasoning over Table

arXiv:2305.07288v1243 citations
Originality Synthesis-oriented
AI Analysis

This provides a benchmark for researchers working on open domain question answering with complex table reasoning, though it is incremental as it adapts existing datasets.

The authors tackled the lack of datasets for open domain question answering over tables that require complex reasoning, and released Open-WikiTable, the first such dataset built upon WikiSQL and WikiTableQuestions with 100,000+ question-answer pairs.

Despite recent interest in open domain question answering (ODQA) over tables, many studies still rely on datasets that are not truly optimal for the task with respect to utilizing structural nature of table. These datasets assume answers reside as a single cell value and do not necessitate exploring over multiple cells such as aggregation, comparison, and sorting. Thus, we release Open-WikiTable, the first ODQA dataset that requires complex reasoning over tables. Open-WikiTable is built upon WikiSQL and WikiTableQuestions to be applicable in the open-domain setting. As each question is coupled with both textual answers and SQL queries, Open-WikiTable opens up a wide range of possibilities for future research, as both reader and parser methods can be applied. The dataset and code are publicly available.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes