Method DriftLLM quantization

Superseded baseline#29 of 80 most-superseded

LLM.int8()

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale

LLM quantization · first seen Aug 15, 2022

superseded — cited as a baseline and beaten by newer methods

3 papers critique it · 0 beat it on benchmarks

What papers say

Verbatim critique sentences, each from a paper that cites LLM.int8() as a baseline.