Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

40,940

Full-text search

Active filters: 4-bit

0xSero/GLM-4.7-REAP-50-W4A16

Text Generation • 2B • Updated about 16 hours ago • 635 • 37

Intel/GLM-4.7-int4-mixed-AutoRound

Text Generation • 2B • Updated 6 days ago • 134 • 21

0xSero/MiniMax-M2.1-REAP-50-W4A16

Text Generation • 17B • Updated 1 day ago • 209 • 19

mlx-community/GLM-4.7-REAP-50-mxfp4

Text Generation • 185B • Updated 3 days ago • 527 • 18

tencent/HY-MT1.5-1.8B-GPTQ-Int4

Translation • 2B • Updated 5 days ago • 563 • 10

tencent/HY-MT1.5-7B-GPTQ-Int4

Translation • 8B • Updated 5 days ago • 336 • 6

QuantTrio/GLM-4.7-AWQ

Text Generation • 358B • Updated 8 days ago • 14.6k • 14

QuantTrio/MiniMax-M2.1-AWQ

Text Generation • 229B • Updated 7 days ago • 1.9k • 8

mlx-community/IQuest-Coder-V1-40B-Loop-Instruct-4bit

Text Generation • 40B • Updated 1 day ago • 670 • 3

0xSero/GLM-4.7-REAP-40-W4A16

Text Generation • 2B • Updated 2 days ago • 520 • 3

MaziyarPanahi/gemma-7b-GGUF

Text Generation • 9B • Updated Feb 29, 2024 • 1.28k • 13

lmstudio-community/Qwen2.5-Coder-7B-Instruct-MLX-4bit

Text Generation • 1B • Updated Nov 13, 2024 • 2.02k • 3

unsloth/Qwen3-8B-unsloth-bnb-4bit

8B • Updated May 13, 2025 • 218k • 13

stelterlab/DeepSeek-R1-0528-Qwen3-8B-AWQ

Text Generation • 8B • Updated Jun 4, 2025 • 5.57k • 4

mlx-community/Qwen3-Next-80B-A3B-Instruct-4bit

Text Generation • Updated Sep 12, 2025 • 4.51k • 22

geoffmunn/Qwen3-1.7B-f16

Text Generation • 2B • Updated about 9 hours ago • 2.39k • 4

garrison/GLM-4.5-Air-REAP-82B-A12B-mlx-4Bit

Text Generation • 82B • Updated Nov 18, 2025 • 45 • 1

0xSero/GLM-4.6-REAP-218B-A32B-W4A16-AutoRound

Text Generation • 2B • Updated 2 days ago • 210 • 5

nota-ai/GLM-4.5-Air-NotaMoeQuant-Int4

Text Generation • 1B • Updated 8 days ago • 65 • 3

Disty0/Qwen-Image-Edit-2511-SDNQ-uint4-svd-r32

Image-to-Image • Updated 14 days ago • 381 • 7

mbakgun/Qwen2.5-Coder-14B-n8n-Workflow-Generator

Text Generation • 15B • Updated 8 days ago • 655 • 2

zimengxiong/WeDLM-8B-Instruct-MLX-4bit

Text Generation • 1B • Updated 4 days ago • 108 • 2

dakerholdings/HY-MT1.5-1.8B-mixed_4_6-mlx

Text Generation • 0.3B • Updated 2 days ago • 47 • 2

Intel/Qwen3-VL-30B-A3B-Instruct-int4-AutoRound

1B • Updated about 4 hours ago • 2

TheBloke/WizardLM-Uncensored-SuperCOT-StoryTelling-30B-GPTQ

Text Generation • 33B • Updated Sep 27, 2023 • 158 • 86

TheBloke/WizardLM-33B-V1-0-Uncensored-SuperHOT-8K-GPTQ

Text Generation • 33B • Updated Aug 21, 2023 • 39 • 93

unsloth/mistral-7b-bnb-4bit

Text Generation • 7B • Updated Sep 11, 2024 • 11.6k • 30

MaziyarPanahi/SynthIA-7B-v1.3-Mistral-7B-Instruct-v0.1-GGUF

Text Generation • 7B • Updated Jan 27, 2024 • 242 • 1

MaziyarPanahi/NSFW_DPO_Noromaid-7b-Mistral-7B-Instruct-v0.1-GGUF

Text Generation • 7B • Updated Jan 29, 2024 • 471 • 4

unsloth/tinyllama-chat-bnb-4bit

Text Generation • 1B • Updated Sep 3, 2024 • 5.82k • 6