Update README.md
Browse files
README.md
CHANGED
|
@@ -17,9 +17,11 @@ This is [Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B) model converted to the
|
|
| 17 |
|
| 18 |
Weight compression was performed using `nncf.compress_weights` with the following parameters:
|
| 19 |
|
| 20 |
-
* mode: **INT4_ASYM**
|
| 21 |
-
* ratio: **0
|
| 22 |
-
* group_size: **128**
|
|
|
|
|
|
|
| 23 |
|
| 24 |
For more information on quantization, check the [OpenVINO model optimization guide](https://docs.openvino.ai/2025/openvino-workflow/model-optimization-guide/weight-compression.html).
|
| 25 |
|
|
|
|
| 17 |
|
| 18 |
Weight compression was performed using `nncf.compress_weights` with the following parameters:
|
| 19 |
|
| 20 |
+
* mode: **INT4_ASYM**
|
| 21 |
+
* ratio: **1.0**
|
| 22 |
+
* group_size: **128**
|
| 23 |
+
* scale_estimation: **True**
|
| 24 |
+
* dataset: **wikitext2**
|
| 25 |
|
| 26 |
For more information on quantization, check the [OpenVINO model optimization guide](https://docs.openvino.ai/2025/openvino-workflow/model-optimization-guide/weight-compression.html).
|
| 27 |
|