CLAIJul 17, 2025

QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation

arXiv:2507.13266v330 citationsh-index: 9Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of enhancing reasoning capabilities in LLMs for math tasks, representing an incremental improvement over existing methods.

The paper tackles the challenge of improving reasoning capacity in large language models via reinforcement learning by proposing Question Augmentation, which introduces partial solutions during training to reduce problem difficulty and provide better learning signals, achieving new state-of-the-art results on math benchmarks with gains of up to +12.79%.

Reinforcement learning (RL) has emerged as a central paradigm for training large language models (LLMs) in reasoning tasks. Yet recent studies question RL's ability to incentivize reasoning capacity beyond the base model. This raises a key challenge: how can RL be adapted to solve harder reasoning problems more effectively? To address this challenge, we propose a simple yet effective strategy via Question Augmentation: introduce partial solutions during training to reduce problem difficulty and provide more informative learning signals. Our method, QuestA, when applied during RL training on math reasoning tasks, not only improves pass@1 but also pass@k-particularly on problems where standard RL struggles to make progress. This enables continual improvement over strong open-source models such as DeepScaleR and OpenMath Nemotron, further enhancing their reasoning capabilities. We achieve new state-of-the-art results on math benchmarks using 1.5B-parameter models: 72.50% (+10.73%) on AIME24, 62.29% (+12.79%) on AIME25, and 41.67% (+10.11%) on HMMT25. Code, data and model are available at https://github.com/foreverlasting1202/QuestA.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes