AICLJan 31, 2025

Bridging the Reasoning Gap: Small LLMs Can Plan with Generalised Strategies

arXiv:2501.18817v1h-index: 2
Originality Incremental advance
AI Analysis

This addresses the accessibility and cost issues in AI reasoning for users with limited resources, though it is incremental as it builds on existing LLM capabilities.

The paper tackles the high cost of large language models (LLMs) for reasoning tasks by proposing methods to enhance smaller LLMs, achieving performance comparable to larger models at a fraction of the cost, with a 30% average cost reduction in experiments.

Recent advancements in the reasoning skills of Large Language Models (LLMs) demonstrate an increase in the ability of LLMs to solve simple planning tasks. However, as long as the driving force behind improved reasoning capability is the size and complexity of the model, the financial and computational costs associated with running them will also increase. This trend raises questions about continued accessibility and whether these improvements will increase at the same pace as models continue to grow in size and expense. We propose two approaches to enhance the reasoning ability of less resource-intensive LLMs. (1) Provide them with a generalised strategy for solving tasks within a given domain, generated by a more resource-intensive LLM. (2) Exploit their cost-effectiveness by iteratively prompting these models to correct errors in their proposed solutions. Our empirical results from planning and mathematical reasoning tasks demonstrate that these methods improve the performance of less resource-intensive LLMs to levels comparable with their more resource-intensive counterparts, at a fraction of the cost. Additionally, we show that the utilisation of generalised strategies in our experiments reduced the cost of the less resource-intensive model by nearly 30 percent on average.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes