huawei-csl/Qwen3-235B-A22B-3bit-SINQ
Text Generation
•
Updated
None defined yet.
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights
Top-Theta Attention: Sparsifying Transformers by Compensated Thresholding