AI CLJul 18, 2024

Thought-Like-Pro: Enhancing Reasoning of Large Language Models through Self-Driven Prolog-based Chain-of-Thought

Xiaoyu Tan, Yongxin Deng, Xihe Qiu, Weidi Xu, Chao Qu, Wei Chu, Yinghui Xu, Yuan Qi

arXiv:2407.14562v214.018 citationsh-index: 12

Originality Incremental advance

AI Analysis

This addresses the problem of inconsistent reasoning performance in LLMs for AI researchers, though it appears incremental as it builds on existing chain-of-thought methods.

The paper tackles the challenge of improving large language models' reasoning by introducing a self-driven framework that uses a Prolog logic engine to generate verified reasoning trajectories, which are then imitated through chain-of-thought prompting, resulting in enhanced reasoning abilities and robust generalization across out-of-distribution tasks.

Large language models (LLMs) have shown exceptional performance as general-purpose assistants, excelling across a variety of reasoning tasks. This achievement represents a significant step toward achieving artificial general intelligence (AGI). Despite these advancements, the effectiveness of LLMs often hinges on the specific prompting strategies employed, and there remains a lack of a robust framework to facilitate learning and generalization across diverse reasoning tasks. To address these challenges, we introduce a novel learning framework, THOUGHT-LIKE-PRO In this framework, we utilize imitation learning to imitate the Chain-of-Thought (CoT) process which is verified and translated from reasoning trajectories generated by a symbolic Prolog logic engine. This framework proceeds in a self-driven manner, that enables LLMs to formulate rules and statements from given instructions and leverage the symbolic Prolog engine to derive results. Subsequently, LLMs convert Prolog-derived successive reasoning trajectories into natural language CoT for imitation learning. Our empirical findings indicate that our proposed approach substantially enhances the reasoning abilities of LLMs and demonstrates robust generalization across out-of-distribution reasoning tasks.

View on arXiv PDF

Similar