Method Drift›KV-cache compression

Superseded baseline#195 of 234 most-superseded

Prompted self-correction

KV-cache compression

superseded — cited as a baseline and beaten by newer methods

1 papers critique it · 0 beat it on benchmarks

What papers say

Verbatim critique sentences, each from a paper that cites Prompted self-correction as a baseline.

“our experiments show this baseline achieves only 19.8% on MATH-500 with Llama-3-8B, lower than standard AR (28.8%), consistent with Huang2024, who show that LLMs cannot reliably self-correct without external feedback”
— Latent Phase-Shift Rollback: Inference-Time Error Correction via Residual Stream Monitoring and KV-Cache Steering

Newer alternatives

Recent methods in the same sub-problem, not yet superseded in the knowledge base.

Latent Phase-Shift Rollback Latent Phase-Shift Rollback: Inference-Time Error Correction via Residual Stream Monitoring and KV-Cache Steering
Apr 20, 2026