Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models Paper • 2504.04823 • Published Apr 7 • 31
IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact Paper • 2403.01241 • Published Mar 2, 2024 • 1