CLAIMay 8, 2024

MIDGARD: Self-Consistency Using Minimum Description Length for Structured Commonsense Reasoning

arXiv:2405.05189v226 citationsh-index: 2ACL
Originality Incremental advance
AI Analysis

This addresses the problem of unreliable structured reasoning in AI for tasks like argument extraction and semantic graph generation, though it is incremental as it builds on self-consistency methods.

The paper tackles error propagation and missing elements in generating reasoning graphs from natural language using LLMs by proposing MIDGARD, which uses Minimum Description Length to aggregate multiple graph samples, achieving superior performance across structured reasoning tasks.

We study the task of conducting structured reasoning as generating a reasoning graph from natural language input using large language models (LLMs). Previous approaches have explored various prompting schemes, yet they suffer from error propagation due to the autoregressive nature and single-pass-based decoding, which lack error correction capability. Additionally, relying solely on a single sample may result in the omission of true nodes and edges. To counter this, we draw inspiration from self-consistency (SC), which involves sampling a diverse set of reasoning chains and taking the majority vote as the final answer. To tackle the substantial challenge of applying SC on generated graphs, we propose MIDGARD (MInimum Description length Guided Aggregation of Reasoning in Directed acyclic graph) that leverages Minimum Description Length (MDL)-based formulation to identify consistent properties among the different graph samples generated by an LLM. This formulation helps reject properties that appear in only a few samples, which are likely to be erroneous, while enabling the inclusion of missing elements without compromising precision. Our method demonstrates superior performance than comparisons across various structured reasoning tasks, including argument structure extraction, explanation graph generation, inferring dependency relations among actions for everyday tasks, and semantic graph generation from natural texts.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes