Inference Providers
Active filters: open-r1
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math_smalllr
Text Generation
• 2B • Updated • 1
yh-yao/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 3
Dongwei/Qwen-2.5-7B_Base_Math_smalllr
Text Generation
• 8B • Updated • 2
• 6
qorbanpour/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 2
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr
Text Generation
• 8B • Updated Dongwei/Qwen-2.5-7B_Base_Math_smallestlr_newdata
Text Generation
• 8B • Updated • 3
schwamaths/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• Updated ibndias/Qwen2.5-1.5B-Open-R1-GRPO1st
Text Generation
• 2B • Updated • 2
schwamaths/Qwen2.5-1.5B-Instruct-Open-R1-GRPO
Text Generation
• Updated • 3
weltonwang88/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated Jiawen006/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated mradermacher/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-GGUF
2B • Updated • 28
AdAstraAbyssoque/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated • 1
qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated • 5
JeffP111/Qwen2.5-3B-GRPO-Countdown
Text Generation
• 3B • Updated susumuota/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 2
susumuota/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated • 1
calledice666/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated DominicX/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated Loong-Ma/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 1
bushou/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated DeeLearning/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 10
KevinWugk/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated didao1234/Qwen2.5-Math-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated princepride/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated daltunay/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 2
herman66/Qwen2.5-0.5B-Open-R1-Distill
Text Generation
• 0.5B • Updated • 7
• 1
tenacioustommy/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 3B • Updated Maker-0409/Qwen-2.5-7B-Simple-RL
Text Generation
• 8B • Updated • 5
whooray/Qwen2.5-1.5B-Open-R1-Distill-ko
Text Generation
• 2B • Updated • 4