Tracked
APEX4
APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute RebalancingLLM quantization · first seen Jun 7, 2026
current frontier — recent, not yet superseded in the knowledge base
0 papers critique it · 0 beat it on benchmarks
Newer alternatives
Recent methods in the same sub-problem, not yet superseded in the knowledge base.
- Jun 7, 2026