cpatonn
/

OpenReasoning-Nemotron-7B-AWQ

compressed-tensors

Model card Files Files and versions

cpatonn commited on Jul 19

Commit

2e7b0b5

·

verified ·

1 Parent(s): 1348d1a

Update README.md

Files changed (1) hide show

README.md +16 -3

README.md CHANGED Viewed

@@ -1,3 +1,16 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+base_model:
+- nvidia/OpenReasoning-Nemotron-7B
+datasets:
+- mit-han-lab/pile-val-backup
+---
+# OpenReasoning-Nemotron-7B-AWQ
+## Method
+Quantised using [vllm-project/llm-compressor](https://github.com/vllm-project/llm-compressor.git) and the following configs:
+```
+recipe = [
+    AWQModifier(ignore=["lm_head"], scheme="W4A16_ASYM", targets=["Linear"]),
+]
+```