CLMay 26, 2023

Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language Models

arXiv:2305.16582v213.351 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses the problem of improving reasoning accuracy in language models for NLP and multimodal tasks, representing an incremental advance over existing methods.

The paper tackles the limitation of Chain-of-Thought reasoning in language models by proposing Graph-of-Thought reasoning to model non-linear human thought processes, achieving a 2.4% accuracy improvement on AQUA-RAT and boosting accuracy from 85.19% to 87.59% on ScienceQA.

With the widespread use of language models (LMs) in NLP tasks, researchers have discovered the potential of Chain-of-thought (CoT) to assist LMs in accomplishing complex reasoning tasks by generating intermediate steps. However, human thought processes are often non-linear, rather than simply sequential chains of thoughts. Therefore, we propose Graph-of-Thought (GoT) reasoning, which models human thought processes not only as a chain but also as a graph. By representing thought units as nodes and connections between them as edges, our approach captures the non-sequential nature of human thinking and allows for a more realistic modeling of thought processes. GoT adopts a two-stage framework with an additional GoT encoder for thought graph representation and fuses the graph representation with the original input representation through a gated fusion mechanism. We evaluate GoT's performance on a text-only reasoning task (AQUA-RAT) and a multimodal reasoning task (ScienceQA). Our model achieves significant improvement over the strong CoT baseline on the AQUA-RAT test set and boosts accuracy from 85.19% to 87.59% using the T5-base model over the state-of-the-art Multimodal-CoT on the ScienceQA test set.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes