Update README.md
Browse files
README.md
CHANGED
|
@@ -8,5 +8,5 @@ base_model:
|
|
| 8 |
|
| 9 |
This model is fine-tuned from the tanliboy/zephyr-gemma-2-9b model using the SelectiveDPO algorithm on the Ultrafeedback_binarized dataset.
|
| 10 |
|
| 11 |
-
For the recipe to reproduce this model, please visit our
|
| 12 |
|
|
|
|
| 8 |
|
| 9 |
This model is fine-tuned from the tanliboy/zephyr-gemma-2-9b model using the SelectiveDPO algorithm on the Ultrafeedback_binarized dataset.
|
| 10 |
|
| 11 |
+
For the recipe to reproduce this model, please visit our [GitHub page](https://github.com/glorgao/SelectiveDPO).
|
| 12 |
|