Commit
·
26ee7e2
1
Parent(s):
2aefa2a
update readme
Browse files
README.md
CHANGED
|
@@ -24,7 +24,7 @@ Global <br>
|
|
| 24 |
Developers looking to take off the shelf pre-quantized models for deployment <br>
|
| 25 |
|
| 26 |
### Release Date: <br>
|
| 27 |
-
Huggingface 05/09/2025 via https://huggingface.co/nvidia/Llama-3.3-70B-Instruct-FP8 <br>
|
| 28 |
|
| 29 |
## Model Architecture:
|
| 30 |
**Architecture Type:** Transformer <br>
|
|
@@ -61,8 +61,8 @@ The model is quantized with nvidia-modelopt **v0.23.0** <br>
|
|
| 61 |
|
| 62 |
## Datasets:
|
| 63 |
* Calibration Dataset: [cnn_dailymail](https://huggingface.co/datasets/abisee/cnn_dailymail) <br>
|
| 64 |
-
** Data collection method:
|
| 65 |
-
** Labeling method:
|
| 66 |
|
| 67 |
|
| 68 |
## Inference:
|
|
|
|
| 24 |
Developers looking to take off the shelf pre-quantized models for deployment <br>
|
| 25 |
|
| 26 |
### Release Date: <br>
|
| 27 |
+
Huggingface 05/09/2025 via https://huggingface.co/nvidia/Llama-3.3-70B-Instruct-FP8 <br>
|
| 28 |
|
| 29 |
## Model Architecture:
|
| 30 |
**Architecture Type:** Transformer <br>
|
|
|
|
| 61 |
|
| 62 |
## Datasets:
|
| 63 |
* Calibration Dataset: [cnn_dailymail](https://huggingface.co/datasets/abisee/cnn_dailymail) <br>
|
| 64 |
+
** Data collection method: The dataset is collected by crawling and extracting news articles and their summaries from the CNN and Daily Mail websites. <br>
|
| 65 |
+
** Labeling method: The dataset is labeled by having the news articles as input and the corresponding highlight as the gold label summary. <br>
|
| 66 |
|
| 67 |
|
| 68 |
## Inference:
|