Granite 4.0 H-Small (FP8)

📣 Update [10-07-2025]: Added a default system prompt to the chat template to guide the model towards more professional, accurate, and safe responses.

This repository contains the FP8 version of Granite-4.0-H-Small.

Please refer to the the original instruct model's model card for additional details: https://huggingface.co/ibm-granite/granite-4.0-h-small

Downloads last month
2,704
Safetensors
Model size
33B params
Tensor type
BF16
·
F8_E4M3
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ibm-granite/granite-4.0-h-small-FP8

Quantized
(25)
this model

Collection including ibm-granite/granite-4.0-h-small-FP8