Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

361

Full-text search

Active filters: 4bit

Hariprasath28/orpheus-3b-4bit-AWQ

Text Generation • 0.9B • Updated Jul 19 • 5

codemajesty/gemma2b-4bit-quantized

Text Generation • 2B • Updated Jul 22 • 2

Irfanuruchi/1B-building-engineering-llm

0.6B • Updated Jul 25 • 2

Irfanuruchi/Llama-3.2-3B-Computer-Engineering-LLM

2B • Updated Jul 25 • 2

jw-sohn/Llama-3.1-8B-Instruct-nf4

Text Generation • 8B • Updated Aug 17 • 1

btbtyler09/Qwen3-30B-A3B-Instruct-2507-gptq-4bit

Text Generation • 5B • Updated Jul 31 • 163 • 3

VincentGOURBIN/voxtral-small-4bit-mixed

25B • Updated Jul 31 • 69

sp-embraceable/e2-llama-v3p3-70B-Merged-v1-LQ

Text Generation • Updated Jul 31

YijingOlivia/llama3-movie-rating-lora

Jacaranda-Health/ASR-STT-4bit

Automatic Speech Recognition • 0.4B • Updated Aug 15 • 1

Seonghaa/CalMate-20B-KO-LoRA

Text Generation • Updated Aug 13

tahamajs/Qwen3-4b-gsm8k-Qlora-SFT

Text Generation • Updated Aug 17 • 30 • 1

tahamajs/Qwen3-4b-gsm8k-Qlora-GRPO

Text Generation • Updated Aug 17 • 9 • 1

sweatSmile/Gemma-3-270m-Buddha-QA

Question Answering • Updated Aug 18 • 1 • 1

Jacaranda-Health/Whisper-Turbo-4bit

Automatic Speech Recognition • 0.5B • Updated Aug 31 • 3

sweatSmile/Qwen3-0.6B-4bit-NEET

Question Answering • 0.6B • Updated Aug 22 • 1

analystgatitu/economist_model_v2

Text Generation • 2B • Updated Aug 22 • 14 • 1

Rakushaking/unsloth-gpt-oss-jp-finetuned

12B • Updated Aug 24 • 20

arkaprovob/medgemma-4b-it-mlx-4bit

0.9B • Updated Aug 25 • 29

trainfarren/john-welbourne-csm-1b-4bit

Text-to-Speech • 1B • Updated Aug 26

BennyDaBall/Mistral-44B-MoE-Patched-MLX-4bit-G64

Text Generation • 44B • Updated Sep 10 • 76

benyamini/DeepSeek-R1-Distill-Llama-8B-AWQ-w4g128

NangWeiLun/MiMo-VL-7B-SFT-bnb-4bit-nf4-dq

Image-Text-to-Text • 8B • Updated Aug 29

NangWeiLun/MiMo-VL-7B-SFT-bnb-4bit-nf4

Image-Text-to-Text • 8B • Updated Aug 29

NangWeiLun/MiMo-VL-7B-SFT-bnb-4bit-fp4

Image-Text-to-Text • 8B • Updated Aug 29 • 1

benyamini/DSR1-8B-llmc-awq-w4

Text Generation • 2B • Updated Aug 29 • 11

Sai2076/LLLMA_FINETUNED_PROJEN

edge-inference/DSR1-8B-llmc-awq-w4

Text Generation • 2B • Updated Aug 30 • 10

edge-inference/DSR1-14B-llmc-awq-w4

Text Generation • 3B • Updated Aug 30 • 11

edge-inference/DSR1-32B-llmc-awq-w4

Text Generation • 6B • Updated Aug 30 • 11