-
-
-
-
-
-
Inference Providers
Active filters:
vLLM
Image-Text-to-Text
•
17B
•
Updated
•
995
•
19
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int8
Text Generation
•
36B
•
Updated
•
144
•
4
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int4
Text Generation
•
36B
•
Updated
•
37
•
5
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int3
Text Generation
•
34B
•
Updated
•
7
•
3
amakhov/tiny-random-llama
Text Generation
•
4.18M
•
Updated
•
25
Text Generation
•
41B
•
Updated
•
10
•
2
QuantTrio/DeepSeek-V3.1-AWQ
Text Generation
•
485B
•
Updated
•
811
•
5
QuantTrio/DeepSeek-V3.1-AWQ-Fp16Mix
Text Generation
•
286B
•
Updated
•
71
•
1
QuantTrio/DeepSeek-V3.1-AWQ-Lite
Text Generation
•
684B
•
Updated
•
18
•
3
JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int8
Text Generation
•
4B
•
Updated
•
1.7k
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int4
Text Generation
•
4B
•
Updated
•
185
•
1
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int8
Text Generation
•
4B
•
Updated
•
235
•
2
JunHowie/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int4
Text Generation
•
31B
•
Updated
•
7.32k
JunHowie/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8
Text Generation
•
31B
•
Updated
•
30
JunHowie/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int4
Text Generation
•
31B
•
Updated
•
48
JunHowie/Qwen2-7B-Instruct-GPTQ-Int4
Text Generation
•
8B
•
Updated
•
10
JunHowie/Qwen2-7B-Instruct-GPTQ-Int8
Text Generation
•
8B
•
Updated
•
52
EliovpAI/Deepseek-R1-0528-Qwen3-8B-FP8-KV
Text Generation
•
8B
•
Updated
•
10
JunHowie/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int8
Text Generation
•
31B
•
Updated
•
8
JunHowie/Seed-OSS-36B-Instruct-GPTQ-Int4
Text Generation
•
36B
•
Updated
•
8
JunHowie/Seed-OSS-36B-Instruct-GPTQ-Int8
Text Generation
•
36B
•
Updated
•
5
QuantTrio/Qwen3-VL-235B-A22B-Instruct-AWQ
Text Generation
•
236B
•
Updated
•
1.77k
•
11
QuantTrio/Qwen3-VL-235B-A22B-Instruct-FP8
Text Generation
•
Updated
•
33
QuantTrio/Qwen3-VL-235B-A22B-Thinking-AWQ
Text Generation
•
236B
•
Updated
•
495
•
6
QuantTrio/Qwen3-VL-235B-A22B-Thinking-FP8
Text Generation
•
236B
•
Updated
•
94
QuantTrio/DeepSeek-V3.2-Exp-AWQ
Text Generation
•
486B
•
Updated
•
57
•
4
QuantTrio/DeepSeek-V3.2-Exp-AWQ-Lite
Text Generation
•
685B
•
Updated
•
79
•
4
Text Generation
•
50B
•
Updated
•
3.95k
•
5
QuantTrio/GLM-4.6-GPTQ-Int4-Int8Mix
Text Generation
•
69B
•
Updated
•
235
•
4
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Text Generation
•
31B
•
Updated
•
564k
•
34