Method Drift›KV-cache compression
Superseded baseline#185 of 234 most-superseded
OPERA
OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-AllocationKV-cache compression · first seen Nov 29, 2023
superseded — cited as a baseline and beaten by newer methods
1 papers critique it · 0 beat it on benchmarks
What papers say
Verbatim critique sentences, each from a paper that cites OPERA as a baseline.
“However, all above mentioned works introduce additional computational overhead, thus slows down model inference speed.”
— PruneHal: Reducing Hallucinations in Multi-modal Large Language Models through Adaptive KV Cache Pruning
Newer alternatives
Recent methods in the same sub-problem, not yet superseded in the knowledge base.