Update README.md
Browse files
README.md
CHANGED
|
@@ -32,7 +32,6 @@ We fine-tuned [google/gemma-2-9b-it](https://huggingface.co/google/gemma-2-9b-it
|
|
| 32 |
|
| 33 |
- **Repository:** https://github.com/princeton-nlp/SimPO
|
| 34 |
- **Paper:** https://arxiv.org/pdf/2405.14734
|
| 35 |
-
- **Demo:** Soon to be alive
|
| 36 |
|
| 37 |
|
| 38 |
## How to Get Started with the Model
|
|
@@ -60,7 +59,7 @@ We use [princeton-nlp/gemma2-ultrafeedback-armorm](https://huggingface.co/datase
|
|
| 60 |
|
| 61 |
#### Training Hyperparameters
|
| 62 |
|
| 63 |
-
|
| 64 |
|
| 65 |
#### Speeds, Sizes, Times
|
| 66 |
|
|
|
|
| 32 |
|
| 33 |
- **Repository:** https://github.com/princeton-nlp/SimPO
|
| 34 |
- **Paper:** https://arxiv.org/pdf/2405.14734
|
|
|
|
| 35 |
|
| 36 |
|
| 37 |
## How to Get Started with the Model
|
|
|
|
| 59 |
|
| 60 |
#### Training Hyperparameters
|
| 61 |
|
| 62 |
+
The hyperparameters used can be found in the [training script](https://github.com/princeton-nlp/SimPO/blob/main/training_configs/gemma-2-9b-it-simpo.yaml).
|
| 63 |
|
| 64 |
#### Speeds, Sizes, Times
|
| 65 |
|