princeton-nlp
/

gemma-2-9b-it-SimPO

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Model card Files Files and versions

princeton-nlp commited on Jul 17, 2024

Commit

c09ba01

·

verified ·

1 Parent(s): d64181d

Update README.md

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -32,7 +32,6 @@ We fine-tuned [google/gemma-2-9b-it](https://huggingface.co/google/gemma-2-9b-it
 - **Repository:** https://github.com/princeton-nlp/SimPO
 - **Paper:** https://arxiv.org/pdf/2405.14734
-- **Demo:** Soon to be alive
 ## How to Get Started with the Model
@@ -60,7 +59,7 @@ We use [princeton-nlp/gemma2-ultrafeedback-armorm](https://huggingface.co/datase
 #### Training Hyperparameters
-[TO BE FILLED LATER]
 #### Speeds, Sizes, Times

 - **Repository:** https://github.com/princeton-nlp/SimPO
 - **Paper:** https://arxiv.org/pdf/2405.14734
 ## How to Get Started with the Model
 #### Training Hyperparameters
+The hyperparameters used can be found in the [training script](https://github.com/princeton-nlp/SimPO/blob/main/training_configs/gemma-2-9b-it-simpo.yaml).
 #### Speeds, Sizes, Times