Llama-3.1-8B-Instruct-nf4

Model Description

Llama-3.1-8B-Instruct quantized using 4-bit NF4 with double quantization.

  • Model type: Causal Language Model
Downloads last month
5
Safetensors
Model size
8B params
Tensor type
F16
F32
U8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support