CLFeb 5

Is my model "mind blurting"? Interpreting the dynamics of reasoning tokens with Recurrence Quantification Analysis (RQA)

Quoc Tuan Pham, Mehdi Jafari, Flora Salim

arXiv:2602.06266v10.61 citationsh-index: 1

Originality Incremental advance

AI Analysis

This provides a principled tool for researchers studying test-time compute in reasoning models, though it is incremental as it applies an existing analysis technique to a new context.

The paper tackled the problem of analyzing reasoning behavior in large models by proposing Recurrence Quantification Analysis (RQA) as a non-textual method to interpret token generation dynamics, showing it improves prediction of task complexity by 8%.

Test-time compute is central to large reasoning models, yet analysing their reasoning behaviour through generated text is increasingly impractical and unreliable. Response length is often used as a brute proxy for reasoning effort, but this metric fails to capture the dynamics and effectiveness of the Chain of Thoughts (CoT) or the generated tokens. We propose Recurrence Quantification Analysis (RQA) as a non-textual alternative for analysing model's reasoning chains at test time. By treating token generation as a dynamical system, we extract hidden embeddings at each generation step and apply RQA to the resulting trajectories. RQA metrics, including Determinism and Laminarity, quantify patterns of repetition and stalling in the model's latent representations. Analysing 3,600 generation traces from DeepSeek-R1-Distill, we show that RQA captures signals not reflected by response length, but also substantially improves prediction of task complexity by 8\%. These results help establish RQA as a principled tool for studying the latent token generation dynamics of test-time scaling in reasoning models.

View on arXiv PDF

Similar