CLJan 26, 2023

Causal Reasoning of Entities and Events in Procedural Texts

AI2CMU
arXiv:2301.10896v3281 citationsh-index: 68
Originality Incremental advance
AI Analysis

This addresses a gap in natural language processing for procedural understanding, though it is incremental as it builds on existing prompting techniques.

The paper tackles the problem of causal reasoning about entities and events in procedural texts, where existing methods focus on either entity state tracking or event reasoning separately. They introduce CREPE, a new benchmark, and show that language models perform poorly (0.35 F1) compared to humans (0.87 F1), but boost performance to 0.67 F1 using code-like prompting with chain-of-thought reasoning.

Entities and events are crucial to natural language reasoning and common in procedural texts. Existing work has focused either exclusively on entity state tracking (e.g., whether a pan is hot) or on event reasoning (e.g., whether one would burn themselves by touching the pan), while these two tasks are often causally related. We propose CREPE, the first benchmark on causal reasoning of event plausibility and entity states. We show that most language models, including GPT-3, perform close to chance at .35 F1, lagging far behind human at .87 F1. We boost model performance to .59 F1 by creatively representing events as programming languages while prompting language models pretrained on code. By injecting the causal relations between entities and events as intermediate reasoning steps in our representation, we further boost the performance to .67 F1. Our findings indicate not only the challenge that CREPE brings for language models, but also the efficacy of code-like prompting combined with chain-of-thought prompting for multihop event reasoning.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes