CLAIAug 29, 2024

Logic Contrastive Reasoning with Lightweight Large Language Model for Math Word Problems

arXiv:2409.00131v11 citationsh-index: 7
Originality Incremental advance
AI Analysis

It addresses the challenge of enhancing mathematical problem-solving for lightweight LLMs, which is an incremental advance in domain-specific reasoning tasks.

This study tackled the problem of improving mathematical reasoning in lightweight Large Language Models by introducing a novel method for measuring logic similarity and using retrieval-enhanced generation with reference problems and prompts, achieving a 15.8% improvement on SVAMP and 21.5% on GSM8K datasets.

This study focuses on improving the performance of lightweight Large Language Models (LLMs) in mathematical reasoning tasks. We introduce a novel method for measuring mathematical logic similarity and design an automatic screening mechanism to construct a set of reference problems that integrate both semantic and logical similarity. By employing carefully crafted positive and negative example prompts, we guide the model towards adopting sound reasoning logic. To the best of our knowledge, this is the first attempt to utilize retrieval-enhanced generation for mathematical problem-solving. Experimental results demonstrate that our method achieves a 15.8% improvement over the Chain of Thought approach on the SVAMP dataset and a 21.5 % improvement on the GSM8K dataset. Further application of this method to a large-scale model with 175 billion parameters yields performance comparable to the best results on both aforementioned datasets. Finally, we conduct an analysis of errors during the reasoning process, providing valuable insights and directions for future research on reasoning tasks using large language models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes