Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

359

Full-text search

Active filters: torchao

pytorch/gemma-3-12b-it-AWQ-INT4

Any-to-Any • Updated Oct 11 • 25.9k • 1

jerryzh168/llama3-int4wo-128

Updated Sep 13, 2024 • 6

medmekk/Meta-Llama-3-8B-quantized-int8_weight_only

Updated Oct 8, 2024 • 5

medmekk/Meta-Llama-3-8B-quantized-int8_dynamic_activation_int8_weight

Updated Oct 8, 2024 • 6

medmekk/Meta-Llama-3-8B-quantized-int4_weight_only

Updated Oct 8, 2024 • 8

medmekk/Meta-Llama-3-8B-quantized-int8_weight_only-2

Updated Oct 8, 2024 • 3

medmekk/Meta-Llama-3-8B-quantized-int4_weight_only-2

Updated Oct 8, 2024 • 5

medmekk/Meta-Llama-3-8B-torchao-int4_weight_only-gs-64

Updated Oct 8, 2024 • 4

medmekk/Meta-Llama-3-8B-torchao-int4_weight_only-gs-32

Updated Oct 8, 2024 • 7

medmekk/Meta-Llama-3-8B-torchao-int4_weight_only-gs_256

Updated Oct 8, 2024 • 6

medmekk/Meta-Llama-3-8B-torchao-int8_weight_only

Updated Oct 9, 2024 • 13

medmekk/Meta-Llama-3-8B-torchao-int8_dynamic_activation_int8_weight

Updated Oct 8, 2024 • 6

medmekk/gpt2-torchao-int8_weight_only

Updated Oct 8, 2024 • 6

medmekk/Llama-3.1-70B-torchao-int8_weight_only

Updated Oct 8, 2024 • 7

medmekk/new_model

Updated Oct 17, 2024 • 3

medmekk/qsdf

Updated Oct 18, 2024 • 14

medmekk/new_gpt2

Updated Oct 18, 2024 • 6

medmekk/an_other_torchao

Updated Oct 18, 2024 • 14

medmekk/an_other_torchao_dynamic

Updated Oct 18, 2024 • 6

marcsun13/Meta-Llama-3-8B-torchao-int8_weight_only

Updated Oct 18, 2024 • 7

medmekk/new_tesing_model

Updated Oct 22, 2024 • 12

medmekk/testing_int4

Updated Oct 22, 2024 • 12

medmekk/quantized_int8_2

Updated Oct 22, 2024 • 7

medmekk/quantized_int4

Updated Oct 22, 2024 • 14

medmekk/quantized_70B

Updated Oct 22, 2024 • 5

medmekk/Meta-Llama-3-8B-torchao-int4_weight_only-gs_128

Updated Oct 22, 2024 • 11

medmekk/custom_name

Updated Oct 22, 2024 • 12

medmekk/custom_name_1

Updated Oct 22, 2024 • 14

medmekk/deepseek-coder-1.3b-base-torchao-int8_weight_only

Updated Oct 22, 2024 • 4

medmekk/testing_repo_name

Updated Oct 22, 2024 • 4