Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -8,5 +8,5 @@ base_model:
 This model is fine-tuned from the tanliboy/zephyr-gemma-2-9b model using the SelectiveDPO algorithm on the Ultrafeedback_binarized dataset.
-For the recipe to reproduce this model, please visit our GitHub page.


8
9	This model is fine-tuned from the tanliboy/zephyr-gemma-2-9b model using the SelectiveDPO algorithm on the Ultrafeedback_binarized dataset.
10
11	+ For the recipe to reproduce this model, please visit our [GitHub page](https://github.com/glorgao/SelectiveDPO).
12