RLHFlow
/

Qwen2.5-Math-7B-Reinforce-Ada-balance-easy

Model card Files Files and versions

baohao commited on about 1 month ago

Commit

24b42ff

·

verified ·

1 Parent(s): 3f96fb7

Update README.md

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -1,3 +1,4 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+---
+Checkpoint from step=500 and trained on the [easy prompt set](https://huggingface.co/datasets/RLHFlow/reinforce_ada_easy_prompt).