RedHatAI
/

NVIDIA-Nemotron-Nano-9B-v2-quantized.w4a16

Text Generation

Model card Files Files and versions

NVIDIA-Nemotron-Nano-9B-v2-quantized.w4a16

6.47 GB

1 contributor

History: 3 commits

ekurtic's picture

Create README.md

b5b4ceb verified about 2 months ago

.gitattributes

1.57 kB

Upload folder using huggingface_hub about 2 months ago
README.md

6.05 kB

Create README.md about 2 months ago
config.json

2.45 kB

Upload folder using huggingface_hub about 2 months ago
configuration_nemotron_h.py

12.2 kB

Upload folder using huggingface_hub about 2 months ago
generation_config.json

158 Bytes

Upload folder using huggingface_hub about 2 months ago
gsm8k_5shot.txt

50.2 kB

Upload folder using huggingface_hub about 2 months ago
model-00001-of-00002.safetensors

4.97 GB
xet

Upload folder using huggingface_hub about 2 months ago
model-00002-of-00002.safetensors

1.48 GB
xet

Upload folder using huggingface_hub about 2 months ago
model.safetensors.index.json

49.2 kB

Upload folder using huggingface_hub about 2 months ago
modeling_nemotron_h.py

78.8 kB

Upload folder using huggingface_hub about 2 months ago
nemotron_toolcall_parser_no_streaming.py

3.72 kB

Upload folder using huggingface_hub about 2 months ago
recipe.yaml

680 Bytes

Upload folder using huggingface_hub about 2 months ago
special_tokens_map.json

422 Bytes

Upload folder using huggingface_hub about 2 months ago
tokenizer.json

17.1 MB
xet

Upload folder using huggingface_hub about 2 months ago
tokenizer_config.json

181 kB

Upload folder using huggingface_hub about 2 months ago