MuntasirHossain
/

flan-t5-base-dialogsum-summarization

@@ -3,6 +3,8 @@ license: apache-2.0
 base_model: google/flan-t5-base
 tags:
 - generated_from_trainer
 model-index:
 - name: flan-t5-base-dialogsum-summarization
   results: []
@@ -15,17 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- eval_loss: 1.2399
-- eval_rouge1: 38.7981
-- eval_rouge2: 14.9183
-- eval_rougeL: 32.7218
-- eval_rougeLsum: 34.5266
-- eval_gen_len: 18.896
-- eval_runtime: 195.7294
-- eval_samples_per_second: 7.664
-- eval_steps_per_second: 0.639
-- epoch: 1.0
-- step: 1039
 ## Model description
@@ -45,12 +42,23 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 12
-- eval_batch_size: 12
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 4
 ### Framework versions

 base_model: google/flan-t5-base
 tags:
 - generated_from_trainer
+metrics:
+- rouge
 model-index:
 - name: flan-t5-base-dialogsum-summarization
   results: []
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.2095
+- Rouge1: 39.3212
+- Rouge2: 15.6335
+- Rougel: 33.4773
+- Rougelsum: 35.1795
+- Gen Len: 18.872
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 4
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
+| 1.1318        | 1.0   | 1558 | 1.2331          | 39.1301 | 15.2555 | 33.1115 | 35.0288   | 18.868  |
+| 1.0483        | 2.0   | 3116 | 1.2095          | 39.3212 | 15.6335 | 33.4773 | 35.1795   | 18.872  |
+| 0.9969        | 3.0   | 4674 | 1.2104          | 40.0115 | 16.029  | 34.0364 | 35.8358   | 18.852  |
+| 0.9601        | 4.0   | 6232 | 1.2161          | 39.7403 | 15.9708 | 33.8644 | 35.5952   | 18.868  |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:471ee2a28705908776ab14887d5b098eef4113560ccc06f4eff7830f9770eb4a
 size 990345064

 version https://git-lfs.github.com/spec/v1
+oid sha256:cda31816de7bfe2f43c47062e20851af21d3f9688e312dbf035f7af93d974631
 size 990345064

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e93315546d8b68c31e5ba89f92b8ee8776de16ba9e4891041984b70b2b532029
 size 5048

 version https://git-lfs.github.com/spec/v1
+oid sha256:7ce60c67e57455efc07032c2725c711b7855837744b69957484ca5f73baae9dc
 size 5048