wenhua cheng's picture

wenhua cheng

wenhuach

·

wenhuach21

AI & ML interests

Model Compression, CV

Recent Activity

reacted to their post with 🔥 3 days ago

🚀 SignRoundV2 for LLM quantization: PTQ-level cost, QAT-level accuracy — yes, even at 2 bits. https://huggingface.co/papers/2512.04746

new activity 3 days ago

Intel/DeepSeek-R1-0528-Qwen3-8B-int4-AutoRound:Improve model card: Add pipeline tag, library name, and update paper/citation

authored a paper 5 days ago

SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs

View all activity

Organizations

authored a paper 5 days ago

SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs

Paper • 2512.04746 • Published 6 days ago • 11

authored 2 papers about 2 years ago

TEQ: Trainable Equivalent Transformation for Quantization of LLMs

Paper • 2310.10944 • Published Oct 17, 2023 • 10

Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs

Paper • 2309.05516 • Published Sep 11, 2023 • 10