Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
inferencerlabs
/
NVIDIA-Nemotron-3-Super-120B-A12B-MLX-4.5bit
like
4
Text Generation
MLX
Safetensors
English
nemotron_h
quantized
conversational
custom_code
4-bit precision
Model card
Files
Files and versions
xet
Community
3
Use this model
main
NVIDIA-Nemotron-3-Super-120B-A12B-MLX-4.5bit
68 GB
1 contributor
History:
30 commits
inferencerlabs
Update README.md
cc8020e
verified
2 days ago
.gitattributes
Safe
1.57 kB
Upload model file
4 days ago
LICENSE
Safe
10 kB
Upload model file
4 days ago
README.md
Safe
2.61 kB
Update README.md
2 days ago
__init__.py
Safe
0 Bytes
Upload model file
4 days ago
chat_template.jinja
Safe
10.8 kB
Upload model file
4 days ago
config.json
Safe
2.33 kB
Upload model file
4 days ago
configuration_nemotron_h.py
Safe
19.8 kB
Upload model file
4 days ago
generation_config.json
Safe
210 Bytes
Upload model file
4 days ago
model-00001-of-00007.safetensors
Safe
10.5 GB
xet
Upload model file
4 days ago
model-00002-of-00007.safetensors
Safe
10.1 GB
xet
Upload model file
4 days ago
model-00003-of-00007.safetensors
Safe
10.1 GB
xet
Upload model file
4 days ago
model-00004-of-00007.safetensors
Safe
10.1 GB
xet
Upload model file
4 days ago
model-00005-of-00007.safetensors
Safe
10.1 GB
xet
Upload model file
4 days ago
model-00006-of-00007.safetensors
Safe
10.1 GB
xet
Upload model file
4 days ago
model-00007-of-00007.safetensors
Safe
6.96 GB
xet
Upload model file
4 days ago
model.safetensors.index.json
Safe
133 kB
Upload model file
4 days ago
modeling_nemotron_h.py
Safe
82.3 kB
Upload model file
4 days ago
super_v3_reasoning_parser.py
Safe
1.88 kB
Upload model file
4 days ago
tokenizer.json
Safe
17.1 MB
xet
Upload model file
4 days ago
tokenizer_config.json
Safe
439 Bytes
Upload model file
4 days ago