turboderp commited on
Commit
0d88db8
·
verified ·
1 Parent(s): c9c3a97

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -0
README.md ADDED
@@ -0,0 +1,27 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+
5
+ EXL3 quants of [Qwen3-Next-80B-A3B-Thinking](https://huggingface.co/Qwen/Qwen3-Next-80B-A3B-Thinking)
6
+
7
+ ⚠️ Requires ExLlamaV3 v0.0.7 (or v0.0.6 `dev` branch)
8
+
9
+ Base bitrates:
10
+
11
+ [2.00 bits per weight](https://huggingface.co/turboderp/Qwen3-Next-80B-A3B-Thinking-exl3/tree/2.0bpw)
12
+ [3.00 bits per weight](https://huggingface.co/turboderp/Qwen3-Next-80B-A3B-Thinking-exl3/tree/3.0bpw)
13
+ [4.00 bits per weight](https://huggingface.co/turboderp/Qwen3-Next-80B-A3B-Thinking-exl3/tree/4.0bpw)
14
+ [5.00 bits per weight](https://huggingface.co/turboderp/Qwen3-Next-80B-A3B-Thinking-exl3/tree/5.0bpw)
15
+ [6.00 bits per weight](https://huggingface.co/turboderp/Qwen3-Next-80B-A3B-Thinking-exl3/tree/6.0bpw)
16
+
17
+ Optimized:
18
+
19
+ [2.08 bits per weight](https://huggingface.co/turboderp/Qwen3-Next-80B-A3B-Thinking-exl3/tree/2.08bpw)
20
+ [2.27 bits per weight](https://huggingface.co/turboderp/Qwen3-Next-80B-A3B-Thinking-exl3/tree/2.27bpw)
21
+ [2.78 bits per weight](https://huggingface.co/turboderp/Qwen3-Next-80B-A3B-Thinking-exl3/tree/2.78bpw)
22
+ [3.14 bits per weight](https://huggingface.co/turboderp/Qwen3-Next-80B-A3B-Thinking-exl3/tree/3.14bpw)
23
+ [3.53 bits per weight](https://huggingface.co/turboderp/Qwen3-Next-80B-A3B-Thinking-exl3/tree/3.53bpw)
24
+ [4.06 bits per weight](https://huggingface.co/turboderp/Qwen3-Next-80B-A3B-Thinking-exl3/tree/4.06bpw)
25
+ [4.51 bits per weight](https://huggingface.co/turboderp/Qwen3-Next-80B-A3B-Thinking-exl3/tree/4.51bpw)
26
+
27
+ ![kld](https://cdn-uploads.huggingface.co/production/uploads/6383dc174c48969dcf1b4fce/1PyK_7p9vbDKy6VWGjjbw.png)