File size: 1,069 Bytes
feea792
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
---
license: mit
base_model: zai-org/GLM-4.5V
base_model_relation: quantized
quantized_by: turboderp
tags:
- exl3
---

EXL3 quants of [GLM-4.5V](https://huggingface.co/zai-org/GLM-4.5V)

⚠️ Requires ExLlamaV3 v0.0.15 (or v0.0.14 `dev` branch)

Base bitrates:

[2.00 bits per weight](https://huggingface.co/turboderp/GLM-4.5V-exl3/tree/2.00bpw)    
[3.00 bits per weight](https://huggingface.co/turboderp/GLM-4.5V-exl3/tree/3.00bpw)    
[4.00 bits per weight](https://huggingface.co/turboderp/GLM-4.5V-exl3/tree/4.00bpw)    

Optimized:

[2.13 bits per weight](https://huggingface.co/turboderp/GLM-4.5V-exl3/tree/2.13bpw)    
[2.32 bits per weight](https://huggingface.co/turboderp/GLM-4.5V-exl3/tree/2.32bpw)    
[2.55 bits per weight](https://huggingface.co/turboderp/GLM-4.5V-exl3/tree/2.55bpw)    
[2.80 bits per weight](https://huggingface.co/turboderp/GLM-4.5V-exl3/tree/2.80bpw)    
[3.07 bits per weight](https://huggingface.co/turboderp/GLM-4.5V-exl3/tree/3.07bpw)    
[3.49 bits per weight](https://huggingface.co/turboderp/GLM-4.5V-exl3/tree/3.49bpw)