Method Drift›KV-cache compression
Superseded baseline#195 of 234 most-superseded
Prompted self-correction
KV-cache compression
superseded — cited as a baseline and beaten by newer methods
1 papers critique it · 0 beat it on benchmarks
What papers say
Verbatim critique sentences, each from a paper that cites Prompted self-correction as a baseline.
“our experiments show this baseline achieves only 19.8% on MATH-500 with Llama-3-8B, lower than standard AR (28.8%), consistent with Huang2024, who show that LLMs cannot reliably self-correct without external feedback”
— Latent Phase-Shift Rollback: Inference-Time Error Correction via Residual Stream Monitoring and KV-Cache Steering
Newer alternatives
Recent methods in the same sub-problem, not yet superseded in the knowledge base.