FUfu99/DeepSeek-V3-1226-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated Apr 28 • 5
FUfu99/Open-Reasoner-Zero-7B-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated Apr 24 • 7
FUfu99/Qwen-2.5-32B-SimpleRL-Zoo-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated Apr 22 • 6
FUfu99/DeepSeek-R1-Zero-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated Apr 22 • 5
FUfu99/Qwen-2.5-14B-SimpleRL-Zoo-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated Apr 19 • 7
FUfu99/Qwen-2.5-7B-SimpleRL-Zoo-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated Apr 19 • 6
FUfu99/deepseek-math-7b-base-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated Apr 10 • 7
FUfu99/Qwen2.5-1.5B-Distill-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated Mar 25 • 6
FUfu99/Qwen2.5-Math-7B-Instruct-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated Mar 20 • 3
FUfu99/Qwen2.5-7B-Instruct-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated Mar 4 • 6
FUfu99/deepseek-math-7b-rl-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated Feb 25 • 5
FUfu99/DeepSeek-R1-Distill-Qwen-1.5B-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated Feb 25 • 5