AIApr 29

Student Guides Teacher: Weak-to-Strong Inference via Spectral Orthogonal Exploration

arXiv:2601.0616071.41 citationsh-index: 3
AI Analysis

Addresses reasoning collapse in LLMs for mathematical reasoning, with preliminary evidence on logic and code tasks.

LLMs suffer from 'Reasoning Collapse' on math tasks, where sampling yields lexical variations of the same error. The proposed Spectral Orthogonal Exploration (SOE) improves accuracy by 62.4% and sampling efficiency by 113.7% over baselines.

Large Language Models (LLMs) often suffer from ''Reasoning Collapse'' on challenging mathematical reasoning tasks, where stochastic sampling produces lexical variations of the same erroneous logic rather than genuine semantic exploration. We observe that failed reasoning traces are often associated with a low-rank bias manifold in the model's hidden-state geometry, which reduces exploration toward corrective solution directions. To address this, we propose Spectral Orthogonal Exploration (SOE), a geometric inference framework under a ''Student Guides Teacher'' paradigm. Instead of using a weak auxiliary agent for imitation, SOE uses it as an orthogonal probe to introduce semantically heterogeneous reasoning signals into the teacher's orthogonal complement of its dominant subspace. This intervention steers the teacher toward more diverse reasoning trajectories and improves exploration beyond standard sampling. Experiments on mathematical benchmarks show that SOE improves average accuracy by 62.4\% and average sampling efficiency by 113.7\% over baseline methods, suggesting that geometric interventions can be effective for mitigating reasoning collapse in mathematical reasoning. We further provide preliminary evidence that SOE is also effective on logic and code generation benchmarks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes