Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jw-sohn
/
Llama-3.1-8B-Instruct-nf4
like
0
Text Generation
Transformers
Safetensors
llama
nf4
4bit
quantization
conversational
text-generation-inference
4-bit precision
bitsandbytes
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
Llama-3.1-8B-Instruct-nf4
Model Description
Llama-3.1-8B-Instruct-nf4
Model Description
Llama-3.1-8B-Instruct quantized using 4-bit NF4 with double quantization.
Model type:
Causal Language Model
Downloads last month
5
Safetensors
Model size
8B params
Tensor type
F16
路
F32
路
U8
路
Chat template
Files info
Inference Providers
NEW
Text Generation
This model isn't deployed by any Inference Provider.
馃檵
Ask for provider support