OPERA (KV-cache compression): superseded — cited as a baseline and beaten by newer methods. 1 paper(s) critique it, 0 beat it on benchmarks — #185 of 234 most-superseded. Sub-problem: cluster led by LURE. Newer alternatives in the same sub-problem include PruneHal.

Superseded baseline#185 of 234 most-superseded

OPERA

OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

KV-cache compression · first seen Nov 29, 2023

superseded — cited as a baseline and beaten by newer methods

1 papers critique it · 0 beat it on benchmarks

What papers say

Verbatim critique sentences, each from a paper that cites OPERA as a baseline.

“However, all above mentioned works introduce additional computational overhead, thus slows down model inference speed.”
— PruneHal: Reducing Hallucinations in Multi-modal Large Language Models through Adaptive KV Cache Pruning

Newer alternatives

Recent methods in the same sub-problem, not yet superseded in the knowledge base.

PruneHal PruneHal: Reducing Hallucinations in Multi-modal Large Language Models through Adaptive KV Cache Pruning
Oct 22, 2025