Checkpoint from step=500 and trained on the easy prompt set.

Downloads last month
15
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including RLHFlow/Qwen2.5-Math-7B-Reinforce-Ada-balance-easy