Superseded baseline#61 of 80 most-superseded
QSVD
QSVD: Efficient Low-rank Approximation for Unified Query-Key-Value Weight Compression in Low-Precision Vision-Language ModelsLLM quantization · first seen Oct 18, 2025
superseded — cited as a baseline and beaten by newer methods
0 papers critique it · 1 beat it on benchmarks
Beaten on benchmarks
Head-to-head results where a newer method reports beating QSVD. Values are copied from the source paper's tables — verify against the cited paper.
- Breaking Modality Heterogeneity in Low-Bit Quantization for Large Vision-Language Models
SplitQ (Ours) beats QSVD · Avg [W4A8 (7B)]
64.1 vs 58.3
- Breaking Modality Heterogeneity in Low-Bit Quantization for Large Vision-Language Models
SplitQ (Ours) beats QSVD · Avg [W4A4 (7B)]
62.9 vs 55.5
- Breaking Modality Heterogeneity in Low-Bit Quantization for Large Vision-Language Models
SplitQ (Ours) beats QSVD · Avg [W4A8 (13B)]
66.9 vs 66.2
- Breaking Modality Heterogeneity in Low-Bit Quantization for Large Vision-Language Models
SplitQ (Ours) beats QSVD · Avg [W4A4 (13B)]
66.4 vs 63.9
Newer alternatives
Recent methods in the same sub-problem, not yet superseded in the knowledge base.
- May 19, 2026
- May 18, 2026
- Quantization-aware Integrated Gradients (QIG)Fine-Grained Post-Training Quantization for Large Vision Language Models with Quantization-Aware Integrated GradientsMar 18, 2026
- SPEED-QSPEED-Q: Staged Processing with Enhanced Distillation towards Efficient Low-bit On-device VLM QuantizationNov 12, 2025
- Quant-dLLMQuant-dLLM: Post-Training Extreme Low-Bit Quantization for Diffusion Large Language ModelsSep 27, 2025