-
-
-
-
-
-
Inference Providers
Active filters:
torchao
medmekk/Llama-3.2-1B-ao-float8da8w
Text Generation
•
Updated
•
5
medmekk/Llama-3.2-1B-ao-autoquant-1
Text Generation
•
Updated
•
3
medmekk/Llama-3.2-1B-ao-float8wo-2
Text Generation
•
Updated
•
3
medmekk/Llama-3.2-1B-ao-float8wo-3
Text Generation
•
Updated
•
5
medmekk/Llama-3.2-1B-ao-int8wo-gs256
Text Generation
•
Updated
•
5
medmekk/Llama-3.2-1B-ao-int4wo-gs128
Text Generation
•
Updated
•
3
medmekk/Qwen2.5-0.5B-Instruct-ao-float8wo
Text Generation
•
Updated
•
4
medmekk/Llama-3.2-1B-ao-int4wo-gs256
Text Generation
•
Updated
•
3
medmekk/Qwen2.5-VL-7B-Instruct-ao-float8wo
medmekk/Qwen2.5-VL-7B-Instruct-ao-int8wo
medmekk/Llama-3.1-8B-Instruct-ao-int8wo
Text Generation
•
Updated
•
5
medmekk/Qwen2.5-VL-7B-Instruct-ao-int8da8w8
medmekk/Llama-3.1-8B-Instruct-ao-autoquant
Text Generation
•
Updated
•
3
medmekk/Llama-3.1-8B-Instruct-ao-int4wo-gs128
Text Generation
•
Updated
•
5
medmekk/Llama-3.1-8B-Instruct-ao-float8wo
Text Generation
•
Updated
•
4
medmekk/Llama-3.1-8B-Instruct-ao-float8da8w8
Text Generation
•
Updated
•
3
medmekk/Llama-3.1-8B-Instruct-ao-int8da8w8
Text Generation
•
Updated
•
6
medmekk/Llama-3.1-8B-Instruct-ao-float8da8w8-2
Text Generation
•
Updated
•
3
medmekk/Llama-3.1-8B-Instruct-ao-int4wo-gs32
Text Generation
•
Updated
•
4
medmekk/Llama-3.1-8B-Instruct-ao-int4wo-gs16
Text Generation
•
Updated
•
3
Erland/vanilla-340M-4096-model-AO-W4
Text Generation
•
Updated
•
4
irresistiblegrace97/TinyLlama-1.1B-Chat-v1.0-torchao-int4_weight_only-gs_4096
Erland/softpick-340M-4096-model-AO-W4
Text Generation
•
Updated
•
6
Erland/softpick-340M-4096-model-AO-W4A4
Text Generation
•
Updated
•
6
Erland/vanilla-340M-4096-model-AO-W4A4
Text Generation
•
Updated
•
5
irresistiblegrace97/tinyllama.gguf
jerryzh168/opt-125m-int4wo
Text Generation
•
Updated
•
5
Text Generation
•
Updated
•
314
•
2
Text Generation
•
Updated
•
128
jerryzh168/opt-125m-int4wo-per-module
Text Generation
•
Updated
•
2.08k