CLAISCJan 27, 2022

Reasoning Like Program Executors

arXiv:2201.11473v2307 citations
AI Analysis

This work addresses the long-standing challenge of improving reasoning in language models for natural language processing tasks, offering a novel approach that could influence future research.

The authors tackled the problem of inadequate reasoning in language models by introducing POET, a pre-training paradigm that uses programs and their execution results to enhance reasoning capabilities. Experimental results on six benchmarks showed significant performance improvements in numerical, logical, and multi-hop reasoning.

Reasoning over natural language is a long-standing goal for the research community. However, studies have shown that existing language models are inadequate in reasoning. To address the issue, we present POET, a novel reasoning pre-training paradigm. Through pre-training language models with programs and their execution results, POET empowers language models to harvest the reasoning knowledge possessed by program executors via a data-driven approach. POET is conceptually simple and can be instantiated by different kinds of program executors. In this paper, we showcase two simple instances POET-Math and POET-Logic, in addition to a complex instance, POET-SQL. Experimental results on six benchmarks demonstrate that POET can significantly boost model performance in natural language reasoning, such as numerical reasoning, logical reasoning, and multi-hop reasoning. POET opens a new gate on reasoning-enhancement pre-training, and we hope our analysis would shed light on the future research of reasoning like program executors.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes