CLNov 12, 2023

Are LLMs Rigorous Logical Reasoners? Empowering Natural Language Proof Generation by Stepwise Decoding with Contrastive Learning

arXiv:2311.06736v33 citationsh-index: 27
Originality Incremental advance
AI Analysis

This work addresses the challenge of enhancing proof planning accuracy in AI, though it appears incremental as it builds on existing multi-stage systems.

The paper tackled the problem of improving logical reasoning in large language models (LLMs) for natural language proof generation by proposing a stepwise decoding approach with contrastive learning to reduce decoding errors, resulting in empirical effectiveness demonstrated in their experiments.

Logical reasoning is a pivotal component in the field of artificial intelligence. Proof planning, particularly in contexts requiring the validation of explanation accuracy, continues to present challenges. The recent advancement of large language models (LLMs) has led to significant progress in natural language proof planning, evolving from one-stage generators to more complex three-stage systems that include additional searchers or verifiers. While these assisted methods improve the quality of generated results, they also introduce increased search efforts and computational costs. Furthermore, the generative process itself remains underexplored. In this study, we propose a stepwise decoding approach augmented by contrastive learning to address two common errors encountered during the LLM generator's decoding process. We fine-tune the language model using both vanilla and enhanced hard negatives to mitigate these decoding errors. Empirical results demonstrate the effectiveness of our strategy. Additionally, our further analysis reveals that even larger LLMs still struggle to generate rigorous logical chains.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes