`ik_llama.cpp` imatrix Quantizations of Qwen/Qwen3-235B-A22B

This quant collection REQUIRES ik_llama.cpp fork to support advanced non-linear SotA quants. Do not download these big files and expect them to run on mainline vanilla llama.cpp, ollama, LM Studio, KoboldCpp, etc!

These quants provide best in class quality for the given memory footprint.

Big Thanks

Shout out to @ubergarm for his diligent work on ik_llama.cpp oriented quanting.

Downloads last month: 1

GGUF

Model size

235B params

Architecture

qwen3moe

Hardware compatibility

6-bit

Model tree for ArtusDev/Qwen3-235B-A22B-GGUF

Base model

Qwen/Qwen3-235B-A22B

Quantized

(43)

this model

ik_llama.cpp imatrix Quantizations of Qwen/Qwen3-235B-A22B

Big Thanks

Model tree for ArtusDev/Qwen3-235B-A22B-GGUF

`ik_llama.cpp` imatrix Quantizations of Qwen/Qwen3-235B-A22B