Tracked
LRQ
LRQ: Optimizing Post-Training Quantization for Large Language Models by Learning Low-Rank Weight-Scaling MatricesLLM quantization · first seen Jul 16, 2024
present, but with little supersession signal in the knowledge base
0 papers critique it · 0 beat it on benchmarks