nvidia
/

Nemotron-H-4B-Base-8K

Text Generation

Model card Files Files and versions

suhara commited on 14 days ago

Commit

faba3b7

·

verified ·

1 Parent(s): d4c0ab3

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -32,6 +32,10 @@ NVIDIA Nemotron-H-4B-Base-8K is a large language model (LLM) developed by NVIDIA
 For best performance on a given task, users are encouraged to customize the model using the NeMo Framework suite of customization tools, including Parameter-Efficient Fine-Tuning (P-tuning, Adapters, LoRA, and more), and Model Alignment (SFT, SteerLM, RLHF, and more) using [NeMo-Aligner](https://github.com/NVIDIA/NeMo-Aligner).
 This model is for research and development only.
 ## License/Terms of Use

 For best performance on a given task, users are encouraged to customize the model using the NeMo Framework suite of customization tools, including Parameter-Efficient Fine-Tuning (P-tuning, Adapters, LoRA, and more), and Model Alignment (SFT, SteerLM, RLHF, and more) using [NeMo-Aligner](https://github.com/NVIDIA/NeMo-Aligner).
+The model was pruned and distilled from [Nemotron-H-Base-8K](https://huggingface.co/nvidia/Nemotron-H-8B-Base-8K) using our hybrid language model compression technique and then fine-tuned into [Nemotron-H-4B-Instruct-128K](https://huggingface.co/nvidia/Nemotron-H-4B-Instruct-128K). For more details, please refer to the [paper](https://arxiv.org/abs/2504.11409).
+The paper has been accepted for publication at NeurIPS 2025.
 This model is for research and development only.
 ## License/Terms of Use