-
-
-
-
-
-
Inference Providers
Active filters:
torchao
metascroy/Qwen3-4B-int8-int4-unsloth
Text Generation
•
Updated
•
134
•
4
pytorch/gemma-3-12b-it-QAT-INT4
Image-Text-to-Text
•
Updated
•
40
•
3
pytorch/gemma-3-27b-it-AWQ-INT4
Image-Text-to-Text
•
Updated
•
3.61k
•
2
pytorch/Qwen3-8B-QAT-INT4
Text Generation
•
Updated
•
47
•
1
vinhnx90/Qwen3-4B-QAT-TorchAO-int4-torchao
Feature Extraction
•
Updated
•
13
•
1
jerryzh168/llama3-int4wo-128
Updated
medmekk/Meta-Llama-3-8B-quantized-int8_weight_only
Updated
medmekk/Meta-Llama-3-8B-quantized-int8_dynamic_activation_int8_weight
medmekk/Meta-Llama-3-8B-quantized-int4_weight_only
medmekk/Meta-Llama-3-8B-quantized-int8_weight_only-2
medmekk/Meta-Llama-3-8B-quantized-int4_weight_only-2
medmekk/Meta-Llama-3-8B-torchao-int4_weight_only-gs-64
medmekk/Meta-Llama-3-8B-torchao-int4_weight_only-gs-32
Updated
medmekk/Meta-Llama-3-8B-torchao-int4_weight_only-gs_256
Updated
medmekk/Meta-Llama-3-8B-torchao-int8_weight_only
medmekk/Meta-Llama-3-8B-torchao-int8_dynamic_activation_int8_weight
medmekk/gpt2-torchao-int8_weight_only
medmekk/Llama-3.1-70B-torchao-int8_weight_only
Updated
medmekk/new_model
medmekk/qsdf
Updated
medmekk/new_gpt2
medmekk/an_other_torchao
Updated
medmekk/an_other_torchao_dynamic
marcsun13/Meta-Llama-3-8B-torchao-int8_weight_only
medmekk/new_tesing_model
medmekk/testing_int4
Updated
medmekk/quantized_int8_2
Updated
medmekk/quantized_int4
Updated
medmekk/quantized_70B
medmekk/Meta-Llama-3-8B-torchao-int4_weight_only-gs_128