OpenVINO
/

Qwen3-8B-int4-ov

Model card Files Files and versions

amokrov commited on Jun 18

Commit

7e1dc3b

·

verified ·

1 Parent(s): 76ab936

Update README.md

Files changed (1) hide show

README.md +5 -3

README.md CHANGED Viewed

@@ -17,9 +17,11 @@ This is [Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B) model converted to the
 Weight compression was performed using `nncf.compress_weights` with the following parameters:
-* mode: **INT4_ASYM**
-* ratio: **0.8**
-* group_size: **128**
 For more information on quantization, check the [OpenVINO model optimization guide](https://docs.openvino.ai/2025/openvino-workflow/model-optimization-guide/weight-compression.html).

 Weight compression was performed using `nncf.compress_weights` with the following parameters:
+ * mode: **INT4_ASYM**
+ * ratio: **1.0**
+ * group_size: **128**
+ * scale_estimation: **True**
+ * dataset: **wikitext2**
 For more information on quantization, check the [OpenVINO model optimization guide](https://docs.openvino.ai/2025/openvino-workflow/model-optimization-guide/weight-compression.html).