samirmsallem
/

gbert-base-coherence_evaluation

Text Classification

Model card Files Files and versions

samirmsallem commited on May 24

Commit

a4c999e

·

verified ·

1 Parent(s): e704632

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -35,7 +35,7 @@ model-index:
 It was trained using a custom annotated dataset of around 12,000 training and 3,000 test examples containing coherent and incoherent text sequences from wikipedia articles in german.
-Compared to the this model, the [large version](https://huggingface.co/samirmsallem/gbert-large-coherence_evaluation) achieved a slightly higher peak accuracy (95.30%) on the validation set, observed at epoch 7. However, the base model reached its lowest evaluation loss (0.2347) earlier during training, suggesting that it converges faster but may underperform slightly in terms of generalization. These findings can inform future model selection depending on whether inference efficiency or accuracy is prioritized.
 |Text Classification Tag| Text Classification Label | Description                             |

 It was trained using a custom annotated dataset of around 12,000 training and 3,000 test examples containing coherent and incoherent text sequences from wikipedia articles in german.
+Compared to this model, the [large version](https://huggingface.co/samirmsallem/gbert-large-coherence_evaluation) achieved a slightly higher peak accuracy (95.30%) on the validation set, observed at epoch 7. However, the base model reached its lowest evaluation loss (0.2347) earlier during training, suggesting that it converges faster but may underperform slightly in terms of generalization. These findings can inform future model selection depending on whether inference efficiency or accuracy is prioritized.
 |Text Classification Tag| Text Classification Label | Description                             |