Models

132

Full-text search

Active filters: quark

superbigtree/Mistral-Nemo-Instruct-2407-FP8_aq

12B • Updated Apr 22, 2025 • 2

aigdat/Llama-3.2-1B-Instruct-awq-uint4-float16

0.4B • Updated Apr 29, 2025 • 2

aigdat/Llama-3.2-3B-Instruct-awq-uint4-float16

0.8B • Updated Apr 24, 2025

aigdat/Phi-3.5-mini-instruct-awq-uint4-float16

0.6B • Updated Apr 29, 2025

aigdat/DeepSeek-R1-Distill-Qwen-1.5B_quantized_int4_bfloat16

0.4B • Updated Apr 29, 2025

aigdat/Qwen3-0.6B_quantized_int4_float16

0.2B • Updated Apr 30, 2025 • 4

aigdat/Arch-Function-Chat-3B_quantized_int4_float16

0.7B • Updated May 5, 2025 • 1

aigdat/DeepCoder-14B-Preview_quantized_int4_float16

3B • Updated May 5, 2025 • 1

aigdat/Qwen2.5-Coder-1.5B-Instruct_quantized_int4_bfloat16

0.4B • Updated May 5, 2025

aigdat/Qwen2.5-Coder-7B-Instruct_quantized_int4_bfloat16

1B • Updated May 6, 2025

aigdat/Qwen2.5-3B-Instruct_quantized_int4_bfloat16

0.7B • Updated May 8, 2025 • 10

aigdat/Qwen2.5-Coder-32B-Instruct_quantized_int4_bfloat16

5B • Updated May 9, 2025 • 2

aigdat/Llama-xLAM-2-8b-fc-r_quantized_int4_bfloat16

2B • Updated May 9, 2025

fxmarty/qwen_1.5-moe-a2.7b-mxfp4

8B • Updated May 13, 2025 • 4.03k

amd/Llama-3.3-70B-Instruct-MXFP4-Preview

38B • Updated Nov 6, 2025 • 4.11k • 2

fxmarty/deepseek_r1_3_layers_mxfp4

8B • Updated May 15, 2025 • 441 • 1

fxmarty/Llama-4-Scout-17B-16E-Instruct-2-layers-mxfp4

5B • Updated Oct 6, 2025 • 2.73k • 1

amd/DeepSeek-R1-MXFP4

371B • Updated 25 days ago • 33.3k • 5

mohitsha/Llama-2-7b-hf-w_mx_fp4_per_group_sym

4B • Updated May 23, 2025

amd/Llama-3.1-405B-Instruct-MXFP4-Preview

218B • Updated Nov 6, 2025 • 482 • 1

amd/DeepSeek-R1-MXFP4-ASQ

363B • Updated Nov 6, 2025 • 19 • 1

haoyang-amd/qwen1.5-0.5B-ptpc

0.5B • Updated Jul 1, 2025

amd/DeepSeek-R1-0528-MXFP4

356B • Updated 26 days ago • 10.1k • 1

fxmarty/Llama-3.1-70B-Instruct-2-layers-mxfp6

3B • Updated Jul 9, 2025 • 2.83k

fxmarty/qwen1.5_moe_a2.7b_chat_w_fp4_a_fp6_e2m3

8B • Updated Jul 11, 2025 • 3.81k

fxmarty/qwen1.5_moe_a2.7b_chat_w_fp6_e2m3_a_fp6_e2m3

11B • Updated Jul 11, 2025 • 1

fxmarty/qwen1.5_moe_a2.7b_chat_w_fp6_e3m2_a_fp6_e3m2

11B • Updated Jul 11, 2025 • 4.15k

amd/Llama-2-70b-chat-hf-WMXFP4-AMXFP4-KVFP8-Scale-UINT8-MLPerf-GPTQ

37B • Updated Aug 5, 2025 • 2

sudhab1988/rakuten-7b-awq-g128-int4-asym-fp16-hf

1B • Updated Jul 15, 2025

matmelis/Llama_3.2_1B_w_uint4_gptq

0.4B • Updated Jul 16, 2025