CLOct 13, 2022

Large Language Models are few(1)-shot Table Reasoners

arXiv:2210.06710v2342 citationsh-index: 34Has Code
Originality Incremental advance
AI Analysis

This work addresses the problem of table reasoning for AI researchers by showing LLMs can serve as a simple baseline, though it is incremental as it extends known few-shot capabilities to a new domain.

The paper explored how well large language models (LLMs) perform on table reasoning tasks like QA and fact verification with few-shot learning, finding that they achieve strong performance, even matching some state-of-the-art models with only 1-shot demonstrations and generating better long-form answers than tuned T5-large.

Recent literature has shown that large language models (LLMs) are generally excellent few-shot reasoners to solve text reasoning tasks. However, the capability of LLMs on table reasoning tasks is yet to be explored. In this paper, we aim at understanding how well LLMs can perform table-related tasks with few-shot in-context learning. Specifically, we evaluated LLMs on popular table QA and fact verification datasets like WikiTableQuestion, FetaQA, TabFact, and FEVEROUS and found that LLMs are competent at complex reasoning over table structures, though these models are not pre-trained on any table corpus. When combined with `chain of thoughts' prompting, LLMs can achieve very strong performance with only a 1-shot demonstration, even on par with some SoTA models. We show that LLMs are even more competent at generating comprehensive long-form answers on FetaQA than tuned T5-large. We further manually studied the reasoning chains elicited from LLMs and found that these reasoning chains are highly consistent with the underlying semantic form. We believe that LLMs can serve as a simple yet generic baseline for future research. The code and data are released in https://github.com/wenhuchen/TableCoT.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes