Edit Models filters

Apps

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

1,532

Full-text search

Active filters: vllm

Maziko/Babelbit-5GbA3J

Text Generation • 120B • Updated Oct 7 • 2

mradermacher/Jinx-Qwen3-30B-A3B-Thinking-2507-GGUF

31B • Updated Oct 8 • 170

Wwayu/Jinx-Qwen3-30B-A3B-Thinking-2507-mlx-6Bit

Text Generation • 31B • Updated Oct 8 • 19

Wwayu/Jinx-Qwen3-30B-A3B-Thinking-2507-mlx-8Bit

Text Generation • 31B • Updated Oct 8 • 30

mradermacher/Jinx-Qwen3-30B-A3B-Thinking-2507-i1-GGUF

31B • Updated Oct 8 • 397 • 3

GaleneAI/Magistral-Small-2509-FP8-Dynamic

Updated Oct 8 • 21 • 2

groxaxo/Qwen3-32B-Uncensored-Autoround-int4

2B • Updated Oct 9 • 20 • 1

ncls-p/gpt-oss-120b-mlx-3Bit

Text Generation • 117B • Updated Oct 9 • 208

akshaykdeo/Mistral-7B-Instruct-v0.3-Q4_K_M-GGUF

7B • Updated Oct 10 • 5

TheHouseOfTheDude/Behemoth-ReduX-123B-v1.1_Compressed-Tensors

Text Generation • Updated Oct 12 • 4

Wwayu/QiMing-Strategist-20B-MXFP4-mlx-4Bit

Text Generation • 21B • Updated Oct 10 • 16

Wwayu/QiMing-Strategist-20B-MXFP4-mlx-6Bit

Text Generation • 21B • Updated Oct 10 • 13

Wwayu/QiMing-Strategist-20B-MXFP4-mlx-8Bit

Text Generation • 21B • Updated Oct 10 • 20

SiddhJagani/gpt-oss-20b-no-think-mlx-2Bit

Text Generation • 21B • Updated Oct 11 • 97 • 1

SiddhJagani/gpt-oss-20b-no-think-mlx-Q3

Text Generation • 21B • Updated Oct 16 • 54

grimjim/Mistral-Small-3.2-24B-Instruct-2506

Image-Text-to-Text • 24B • Updated Oct 20 • 13

Babsie/DeepHermes3_24B_textonly

24B • Updated about 1 month ago • 9 • 1

TheHouseOfTheDude/L3.3-70B-Animus-V12.0_Compressed-Tensors

Text Generation • Updated Oct 13 • 1

rockon1095/Mixtral-8x7B-Instruct-v0.1-Q4_0-GGUF

47B • Updated Oct 13 • 67

mkenfenheuer/Mistral-7B-Instruct-v0.3-Q4_K_M-GGUF

7B • Updated Oct 13 • 1

Bellesteck/Apriel-1.5-15b-Thinker-FP8-W8A8

Image-Text-to-Text • 14B • Updated Oct 13 • 2

amitkparekh/Mistral-Nemo-Graft-2407

12B • Updated Oct 13 • 1

amitkparekh/Mistral-Small-3.1-24B-Graft-2503

24B • Updated Oct 13 • 1

TheHouseOfTheDude/Behemoth-X-123B-v2.1_Compressed-Tensors

Text Generation • Updated 24 days ago • 69

SiddhJagani/gpt-oss-20b-no-think-mlx-Q4

Text Generation • 21B • Updated Oct 16 • 57

SiddhJagani/gpt-oss-20b-no-think-mlx-Q6

Text Generation • 21B • Updated Oct 16 • 54

SiddhJagani/gpt-oss-20b-no-think-mlx-Q8

Text Generation • 21B • Updated 26 days ago • 252 • 1

RedHatAI/Qwen3-VL-235B-A22B-Instruct-NVFP4

Text Generation • 133B • Updated 6 days ago • 3.81k • 2

roleplaiapp/mistral_fp8

Image-Text-to-Text • Updated Oct 16 • 12

fraseque/Llama-3.3-70B-FP8-Instruct-Neuron

Text Generation • 71B • Updated 24 days ago • 16