-
-
-
-
-
-
Inference Providers
Active filters:
4bit
legraphista/Llama-3.2-3B-Instruct-IMat-GGUF
Text Generation
•
3B
•
Updated
•
737
•
1
Narrator5000/llavanext-finetuned-stackoverflow-vqa
Updated
•
5
•
1
NeoChen1024/internlm2_5-20b-chat-exl2-4.25bpw-h8
Text Generation
•
Updated
ussipan/SipanGPT-0.1-Llama-3.2-1B-GGUF
Text Generation
•
1B
•
Updated
•
62
•
1
ussipan/SipanGPT-0.2-Llama-3.2-1B-GGUF
Text Generation
•
1B
•
Updated
•
121
mcavus/glm-4v-9b-gptq-4bit-dynamo
3B
•
Updated
•
2
•
1
ussipan/SipanGPT-0.3-Llama-3.2-1B-GGUF
Text Generation
•
1B
•
Updated
•
70
•
1
harishnair04/Gemma-medtr-2b-sft
Text Generation
•
2B
•
Updated
•
2
harishnair04/Gemma-medtr-2b-sft-v2
Text Generation
•
3B
•
Updated
•
4
mradermacher/Gemma-medtr-2b-sft-v2-GGUF
3B
•
Updated
•
116
NaomiBTW/L3-8B-Lunaris-v1-GPTQ
Text Generation
•
Updated
ModelCloud/Qwen2.5-Coder-32B-Instruct-gptqmodel-4bit-vortex-v1
Text Generation
•
7B
•
Updated
•
56
•
16
Rakushaking/llm-jp-3-13b-it
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v1
Text Generation
•
7B
•
Updated
•
41
•
51
nisten/qwen2.5-coder-7b-abliterated-128k-AWQ
Text Generation
•
2B
•
Updated
•
4
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v2
Text Generation
•
7B
•
Updated
•
28
•
16
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v3
Text Generation
•
7B
•
Updated
•
15
•
14
mlx-community/Qwen2.5-7B-Instruct-kowiki-qa-4bit
Text Generation
•
1B
•
Updated
•
5
ModelCloud/Falcon3-10B-Instruct-gptqmodel-4bit-vortex-v1
Text Generation
•
2B
•
Updated
•
13
•
3
adriabama06/SmallThinker-3B-Preview-AWQ
Text Generation
•
Updated
•
5
•
1
exxocism/Linkbricks-Horizon-AI-Llama-3.3-Korean-70B-sft-dpo-GGUF
Text Generation
•
Updated
ehristoforu/Phi4-MoE-2x14B-Instruct
Text Generation
•
14B
•
Updated
•
6
ModelCloud/Qwen2.5-0.5B-Instruct-gptqmodel-w4a16
Text Generation
•
0.5B
•
Updated
•
20
•
1
ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v1
Text Generation
•
2B
•
Updated
•
14
•
5
ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v2
Text Generation
•
2B
•
Updated
•
115
•
7
vital-ai/watt-tool-70B-awq
11B
•
Updated
•
3
•
4
curiousmind147/microsoft-phi-4-AWQ-4bit-GEMM
Text Generation
•
3B
•
Updated
•
234
•
1
ConfidentialMind/Mistral-Small-24B-Instruct-2501_GPTQ_G128_W4A16_MSE
Text Classification
•
4B
•
Updated
•
29
•
1
ConfidentialMind/Virtuoso-Medium-v2_GPTQ_G128_W4A16
Text Generation
•
6B
•
Updated
•
3
ConfidentialMind/Virtuoso-Medium-v2_GPTQ_G32_W4A16
Text Generation
•
7B
•
Updated
•
15