PetterLee commited on
Commit
1b3f123
·
verified ·
1 Parent(s): 2f807f9

Model save

Browse files
README.md CHANGED
@@ -39,12 +39,12 @@ The following hyperparameters were used during training:
39
  - train_batch_size: 4
40
  - eval_batch_size: 8
41
  - seed: 42
42
- - gradient_accumulation_steps: 8
43
- - total_train_batch_size: 32
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: constant
46
  - lr_scheduler_warmup_ratio: 0.03
47
- - num_epochs: 10
48
 
49
  ### Training results
50
 
 
39
  - train_batch_size: 4
40
  - eval_batch_size: 8
41
  - seed: 42
42
+ - gradient_accumulation_steps: 4
43
+ - total_train_batch_size: 16
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: constant
46
  - lr_scheduler_warmup_ratio: 0.03
47
+ - num_epochs: 5
48
 
49
  ### Training results
50
 
runs/Apr19_23-43-04_Hulk/events.out.tfevents.1745077386.Hulk.265296.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5245fdbd6b5f6871f1859fbe3c2846aae4ef7b78afe4cb8d20701cdca375dc69
3
- size 15284
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ad32af8f59e82c683ca386afc30c927f0639f7997497ec0c0500d92fd2e6c129
3
+ size 15638