-
-
-
-
-
-
Inference Providers
Active filters:
4-bit
dousery/medical-reasoning-gpt-oss-20b
Text Generation
•
21B
•
Updated
•
2.22k
•
43
mlx-community/MiniMax-M2-4bit
Text Generation
•
229B
•
Updated
•
164
•
5
mlx-community/GLM-4.6-4bit
Text Generation
•
353B
•
Updated
•
4.01k
•
11
mlx-community/chandra-4bit
Image-to-Text
•
Updated
•
122
•
4
MaziyarPanahi/Mistral-7B-Instruct-v0.3-GGUF
Text Generation
•
7B
•
Updated
•
100k
•
126
Qwen/Qwen3-14B-AWQ
Text Generation
•
3B
•
Updated
•
134k
•
39
nightmedia/VCoder-120b-1.0-qx86-hi-mlx
Text Generation
•
117B
•
Updated
•
90
•
3
Jalea96/DeepSeek-OCR-bnb-4bit-NF4
Image-Text-to-Text
•
3B
•
Updated
•
584
•
3
TheBloke/MythoMax-L2-13B-GPTQ
Text Generation
•
2B
•
Updated
•
870
•
215
Qwen/Qwen3-32B-AWQ
Text Generation
•
6B
•
Updated
•
522k
•
113
Qwen/Qwen3-235B-A22B-GPTQ-Int4
Text Generation
•
Updated
•
54.1k
•
25
QuantTrio/Qwen3-VL-235B-A22B-Instruct-AWQ
Text Generation
•
Updated
•
5.46k
•
7
Kavyaah/medical-coding-llm
4B
•
Updated
•
108
•
5
unsloth/Qwen3-VL-8B-Instruct-unsloth-bnb-4bit
Image-Text-to-Text
•
9B
•
Updated
•
20.4k
•
7
Edison2525/Qwen3-8B-AWQ
8B
•
Updated
•
66
•
2
manasmisra/GLM-4.5-Air-REAP-82B-A12B-mlx-4Bit
Text Generation
•
82B
•
Updated
•
407
•
2
unsloth/Qwen3-VL-2B-Instruct-unsloth-bnb-4bit
Image-Text-to-Text
•
2B
•
Updated
•
1.08k
•
2
QuantTrio/Qwen3-VL-32B-Instruct-AWQ
Image-Text-to-Text
•
33B
•
Updated
•
984
•
2
mlx-community/LLaDA2.0-flash-preview-4bit
Text Generation
•
103B
•
Updated
•
27
•
2
TheBloke/Wizard-Vicuna-13B-Uncensored-GPTQ
Text Generation
•
2B
•
Updated
•
308
•
320
TheBloke/Wizard-Vicuna-30B-Uncensored-GPTQ
Text Generation
•
4B
•
Updated
•
585
•
586
TheBloke/Phind-CodeLlama-34B-v2-GPTQ
Text Generation
•
5B
•
Updated
•
30
•
90
TheBloke/leo-hessianai-13B-chat-AWQ
Text Generation
•
2B
•
Updated
•
36
•
1
TheBloke/Psyfighter-13B-GPTQ
Text Generation
•
2B
•
Updated
•
15
•
7
TheBloke/Mistral-7B-Instruct-v0.2-AWQ
Text Generation
•
1B
•
Updated
•
55.8k
•
51
MaziyarPanahi/Mistral-7B-Instruct-Aya-101-GGUF
Text Generation
•
7B
•
Updated
•
312
•
10
macadeliccc/Hermes-2-Pro-Mistral-7B-AWQ
Text Generation
•
1B
•
Updated
•
4
•
1
RichardErkhov/akdeniz27_-_roberta-base-cuad-4bits
Text Generation
•
83.6M
•
Updated
•
1
unsloth/mistral-7b-instruct-v0.3-bnb-4bit
Text Generation
•
4B
•
Updated
•
52.8k
•
33
hugging-quants/Meta-Llama-3.1-405B-Instruct-AWQ-INT4
Text Generation
•
59B
•
Updated
•
544
•
37