Is ScoutAttention superseded?

ScoutAttention (KV-cache compression): current frontier — recent, not yet superseded in the knowledge base. 0 paper(s) critique it, 0 beat it on benchmarks — not ranked as a superseded baseline. Sub-problem: cluster led by Quest. Newer alternatives in the same sub-problem include ParisKV, KVDrive, Louver, IceCache, ScoutAttention.

Method Drift›KV-cache compression

Tracked

ScoutAttention

ScoutAttention: Efficient KV Cache Offloading via Layer-Ahead CPU Pre-computation for LLM Inference

KV-cache compression · first seen Mar 28, 2026

current frontier — recent, not yet superseded in the knowledge base

0 papers critique it · 0 beat it on benchmarks

Newer alternatives

Recent methods in the same sub-problem, not yet superseded in the knowledge base.

ParisKV ParisKV: Fast and Drift-Robust KV-Cache Retrieval for Long-Context LLMs
May 28, 2026
KVDrive KVDrive: A Holistic Multi-Tier KV Cache Management System for Long-Context LLM Inference
May 18, 2026
Louver Sparse Attention as a Range Searching Problem: Towards an Inference-Efficient Index for KV Cache
May 7, 2026
IceCache IceCache: Memory-efficient KV-cache Management for Long-Sequence LLMs
Apr 12, 2026
ScoutAttention ScoutAttention: Efficient KV Cache Offloading via Layer-Ahead CPU Pre-computation for LLM Inference
Mar 28, 2026
DynSplit-KV DynSplit-KV: Dynamic Semantic Splitting for KVCache Compression in Efficient Long-Context LLM Inference
Feb 3, 2026
HeteroCache HeteroCache: A Dynamic Retrieval Approach to Heterogeneous KV Cache Compression for Long-Context LLM Inference
Jan 20, 2026
CXL-SpecKV CXL-SpecKV: A Disaggregated FPGA Speculative KV-Cache for Datacenter LLM Serving
Dec 11, 2025
CLO CLO: Efficient LLM Inference System with CPU-Light KVCache Offloading via Algorithm-System Co-Design
Nov 18, 2025
LouisKV LouisKV: Efficient KV Cache Retrieval for Long Input-Output Sequences
Oct 13, 2025