Method DriftSpeculative decoding

Heavily superseded#4 of 151 most-superseded

Medusa

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

Speculative decoding · first seen Jan 19, 2024

heavily superseded — a standard baseline that newer methods routinely beat

19 papers critique it · 9 beat it on benchmarks

What papers say

Verbatim critique sentences, each from a paper that cites Medusa as a baseline.

Beaten on benchmarks

Head-to-head results where a newer method reports beating Medusa. Values are copied from the source paper's tables — verify against the cited paper.

Newer alternatives

Recent methods in the same sub-problem, not yet superseded in the knowledge base.