ik_llama.cpp imatrix Quantizations of Qwen/Qwen3-235B-A22B
This quant collection REQUIRES ik_llama.cpp fork to support advanced non-linear SotA quants. Do not download these big files and expect them to run on mainline vanilla llama.cpp, ollama, LM Studio, KoboldCpp, etc!
These quants provide best in class quality for the given memory footprint.
Big Thanks
Shout out to @ubergarm for his diligent work on ik_llama.cpp oriented quanting.
- Downloads last month
- 1
Hardware compatibility
Log In
to view the estimation
6-bit
Model tree for ArtusDev/Qwen3-235B-A22B-GGUF
Base model
Qwen/Qwen3-235B-A22B