-
-
-
-
-
-
Inference Providers
Active filters:
torchao
gurro/llama-3.1-8B-torchao-int4wo-128
Text Generation
•
Updated
•
6
gurro/llama-3.1-8B-torchao-int4wo-256
Text Generation
•
Updated
•
9
jerryzh168/llama3-8b-autoquant
Text Generation
•
Updated
•
29
medmekk/Llama-3.1-8B-Instruct-torchao-int8_weight_only
medmekk/Llama-3.1-8B-Instruct-torchao-int8wo
medmekk/Llama-3.1-8B-Instruct-torchao-int8da8w
medmekk/Llama-3.2-3B-Instruct-torchao-int8wo
medmekk/Llama-3.2-1B-torchao-int8wo
medmekk/Llama-3.2-1B-torchao-int8da8w
medmekk/Llama-3.2-3B-Instruct-torchao-int8da8w
medmekk/Llama-3.1-70B-Instruct-torchao-int8da8w
jerryzh168/Meta-Llama-3-8B-torchao-int8_weight_only
jerryzh168/Meta-Llama-3-8B-torchao-int4_weight_only-gs_128
jerryzh168/Meta-Llama-3-8B-torchao-int4_weight_only-gs_64
HF-Quantization/Llama-3.2-1B-TORCHAO-W8
HF-Quantization/Llama-3.2-1B-TORCHAO-W8A8
HF-Quantization/Llama-3.2-1B-TORCHAO-W4
HF-Quantization/Llama-3.3-70B-Instruct-TORCHAO-W4
jpablomch/Meta-Llama-3-8B-Instruct-torchao
Text Generation
•
Updated
•
9
jerryzh168/llama3-8b-int4wo-128
Text Generation
•
Updated
•
12
jerryzh168/llama3-8b-int8wo
Text Generation
•
Updated
•
6
alpindale/Meta-Llama-3-8B-torchao-int8_weight_only
Text Generation
•
Updated
•
7
Text Generation
•
Updated
•
10
drisspg/float8_dynamic_act_float8_weight-opt-125m
Text Generation
•
Updated
•
6
marksaroufim/Meta-Llama-3-8B-torchao-int8_weight_only
Text Generation
•
Updated
•
7
Text Generation
•
Updated
•
6
Any-to-Any
•
Updated
•
7
jerryzh168/gemma3-4b-it-float8dq
Any-to-Any
•
Updated
•
7