LGOct 18, 2024

Make LLMs better zero-shot reasoners: Structure-orientated autonomous reasoning

arXiv:2410.19000v17 citationsh-index: 16Has Code
Originality Incremental advance
AI Analysis

This work addresses the problem of enhancing zero-shot reasoning in LLMs for complex question-answering tasks, representing an incremental improvement over existing methods like Chain-of-Thought and ReAct.

The paper tackles the limitation of current zero-shot reasoning methods in complex multi-step tasks by introducing a structure-oriented analysis method and a multi-agent system called SARA, which improves reasoning accuracy and robustness, sometimes surpassing few-shot methods in experiments.

Zero-shot reasoning methods with Large Language Models (LLMs) offer significant advantages including great generalization to novel tasks and reduced dependency on human-crafted examples. However, the current zero-shot methods still have limitations in complex tasks, e.g., answering questions that require multi-step reasoning. In this paper, we address this limitation by introducing a novel structure-oriented analysis method to help LLMs better understand the question and guide the problem-solving process of LLMs. We first demonstrate how the existing reasoning strategies, Chain-of-Thought and ReAct, can benefit from our structure-oriented analysis. In addition to empirical investigations, we leverage the probabilistic graphical model to theoretically explain why our structure-oriented analysis can improve the LLM reasoning process. To further improve the reliability in complex question-answering tasks, we propose a multi-agent reasoning system, Structure-oriented Autonomous Reasoning Agents (SARA), that can better enforce the reasoning process following our structure-oriented analysis by refinement techniques and is equipped with external knowledge retrieval capability to reduce factual errors. Extensive experiments verify the effectiveness of the proposed reasoning system. Surprisingly, in some cases, the system even surpasses few-shot methods. Finally, the system not only improves reasoning accuracy in complex tasks but also demonstrates robustness against potential attacks that corrupt the reasoning process.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes