CLDec 21, 2025

MDToC: Metacognitive Dynamic Tree of Concepts for Boosting Mathematical Problem-Solving of Large Language Models

arXiv:2512.18841v1h-index: 10
Originality Incremental advance
AI Analysis

This addresses the challenge of reliable mathematical problem-solving in LLMs, offering incremental but consistent gains over prior methods.

The paper tackles the problem of calculation verification in mathematical reasoning for Large Language Models by introducing MDToC, a three-phase approach that constructs concept trees and uses majority voting. The method achieves improvements of up to 7.6% over existing prompting techniques on benchmarks like CHAMP, MATH, and Game-of-24.

Despite advances in mathematical reasoning capabilities, Large Language Models (LLMs) still struggle with calculation verification when using established prompting techniques. We present MDToC (Metacognitive Dynamic Tree of Concepts), a three-phase approach that constructs a concept tree, develops accuracy-verified calculations for each concept, and employs majority voting to evaluate competing solutions. Evaluations across CHAMP, MATH, and Game-of-24 benchmarks demonstrate our MDToC's effectiveness, with GPT-4-Turbo achieving 58.1\% on CHAMP, 86.6\% on MATH, and 85\% on Game-of-24 - outperforming GoT by 5\%, 5.4\%, and 4\% on all these tasks, respectively, without hand-engineered hints. MDToC consistently surpasses existing prompting methods across all backbone models, yielding improvements of up to 7.6\% over ToT and 6.2\% over GoT, establishing metacognitive calculation verification as a promising direction for enhanced mathematical reasoning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes