BabyLM-community/nld-baseline-small

Files changed (6) hide show

README.md CHANGED Viewed

@@ -13,6 +13,8 @@ should probably proofread and complete it, then remove this comment. -->
 # nld-baseline-small
 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 ## Model description
@@ -37,10 +39,17 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
 ### Framework versions

 # nld-baseline-small
 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 3.1400
 ## Model description
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss |
+|:-------------:|:-----:|:-----:|:---------------:|
+| 4.0996        | 1.0   | 8192  | 3.5003          |
+| 3.474         | 2.0   | 16384 | 3.2941          |
+| 3.3385        | 3.0   | 24576 | 3.2077          |
+| 3.2706        | 4.0   | 32768 | 3.1622          |
+| 3.2318        | 5.0   | 40960 | 3.1400          |
 ### Framework versions

merges.txt CHANGED Viewed

The diff for this file is too large to render. See raw diff

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4e7e94a009700453fc4ef18ef13fa31ccf141e77f92723bf495ae0602d475840
 size 68273200

 version https://git-lfs.github.com/spec/v1
+oid sha256:1e11dca7bd5acd4837f7f651ef70bc9d64be9ab568055cb1ad6f56c515671b47
 size 68273200

tokenizer.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:112fa1cba32b52e78334fc7f6dc865fae8fcdbcf8c2952e1181e8447bd897336
 size 5777

 version https://git-lfs.github.com/spec/v1
+oid sha256:15c79dadfb274f646887b8e29c5eaabb9e33359641894e1d079a64509321431c
 size 5777

vocab.json CHANGED Viewed

The diff for this file is too large to render. See raw diff