Is uniform bit-width quantization superseded?

Question

Accepted Answer

uniform bit-width quantization (Mixture-of-experts routing): superseded — cited as a baseline and beaten by newer methods. 2 paper(s) critique it, 0 beat it on benchmarks — #241 of 1370 most-superseded. Sub-problem: cluster led by MoEQuant. Newer alternatives in the same sub-problem include BitsMoE, GEMQ, KBVQ-MoE, MC# (Mixture-Compressor-sharp).

What papers say

Newer alternatives