Method DriftLLM quantization

Tracked

APEX4

APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing

LLM quantization · first seen Jun 7, 2026

current frontier — recent, not yet superseded in the knowledge base

0 papers critique it · 0 beat it on benchmarks

Newer alternatives

Recent methods in the same sub-problem, not yet superseded in the knowledge base.