EXL3 quants of nanochat-d34

⚠️ Requires ExLlamaV3 v0.0.19 (or v0.0.18 dev branch)

Base bitrates:

2.00 bits per weight
3.00 bits per weight
4.00 bits per weight
5.00 bits per weight
6.00 bits per weight

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for turboderp/nanochat-d34-exl3

Quantized
(1)
this model

Collection including turboderp/nanochat-d34-exl3