v0.32.0

See https://github.com/quic/ai-hub-models/releases/v0.32.0 for changelog.

Files changed (2) hide show

LICENSE ADDED Viewed


1	+ The license of the original trained model can be found at https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct/blob/main/LICENSE.txt.
2	+ The license for the deployable model files (.tflite, .onnx, .dlc, .bin, etc.) can be found at https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct/blob/main/LICENSE.txt.

README.md CHANGED Viewed

@@ -28,8 +28,6 @@ This model is an implementation of Llama-v3.2-3B-Instruct found [here](https://h
 - **Model Stats:**
   - Input sequence length for Prompt Processor: 128
   - Context length: 4096
-  - Number of parameters: 3B
-  - Model size: 2.4G
   - Precision: w4a16 + w8a16 (few layers)
   - Num of key-value heads: 8
   - Model-1 (Prompt Processor): Llama-PromptProcessor-Quantized

 - **Model Stats:**
   - Input sequence length for Prompt Processor: 128
   - Context length: 4096
   - Precision: w4a16 + w8a16 (few layers)
   - Num of key-value heads: 8
   - Model-1 (Prompt Processor): Llama-PromptProcessor-Quantized