geoffmunn
/

Qwen3-1.7B-f16

Text Generation

qwen3-1.7b-gguf

Model card Files Files and versions

geoffmunn commited on Sep 22

Commit

8970b24

·

verified ·

1 Parent(s): 632c24b

Update Qwen3-1.7B-Q4_K_M/README.md

Files changed (1) hide show

Qwen3-1.7B-Q4_K_M/README.md +2 -2

Qwen3-1.7B-Q4_K_M/README.md CHANGED Viewed

@@ -20,7 +20,7 @@ Quantized version of [Qwen/Qwen3-1.7B](https://huggingface.co/Qwen/Qwen3-1.7B) a
 ## Model Info
 - **Format**: GGUF (for llama.cpp and compatible runtimes)
-- **Size**: 1.2G
 - **Precision**: Q4_K_M
 - **Base Model**: [Qwen/Qwen3-1.7B](https://huggingface.co/Qwen/Qwen3-1.7B)
 - **Conversion Tool**: [llama.cpp](https://github.com/ggerganov/llama.cpp)
@@ -121,7 +121,7 @@ Here’s how you can query this model via API using `curl` and `jq`. Replace the
 ```bash
 curl http://localhost:11434/api/generate -s -N -d '{
-  "model": "hf.co/geoffmunn/Qwen3-1.7B:Q4_K_M;2D",
   "prompt": "Respond exactly as follows: Write a short limerick about a robot who loves gardening.",
   "temperature": 0.8,
   "top_p": 0.95,

 ## Model Info
 - **Format**: GGUF (for llama.cpp and compatible runtimes)
+- **Size**: 1.28 GB
 - **Precision**: Q4_K_M
 - **Base Model**: [Qwen/Qwen3-1.7B](https://huggingface.co/Qwen/Qwen3-1.7B)
 - **Conversion Tool**: [llama.cpp](https://github.com/ggerganov/llama.cpp)
 ```bash
 curl http://localhost:11434/api/generate -s -N -d '{
+  "model": "hf.co/geoffmunn/Qwen3-1.7B:Q4_K_M",
   "prompt": "Respond exactly as follows: Write a short limerick about a robot who loves gardening.",
   "temperature": 0.8,
   "top_p": 0.95,