Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ZhuofengLi's picture
6 17 10

ZhuofengLi PRO

ZhuofengLi
Mi6paulino's profile picture Arunsajeevan199303's profile picture plusn's profile picture
·
https://github.com/Zhuofeng-Li
  • zhuofengli96475
  • Zhuofeng-Li
  • zhuofeng-li-6a528626a

AI & ML interests

Agents, Reasoning LLMs/VLLMs, RL

Organizations

VerlTool's profile picture Hugging Face MCP Course's profile picture AgentFlow's profile picture

ZhuofengLi 's models 13

ZhuofengLi/torl-qwen2.5-7b-instruct

8B • Updated Sep 11 • 2

ZhuofengLi/octo-science-qwen2.5-7b-grpo-step-40-v2

2B • Updated Aug 3 • 6

ZhuofengLi/octo-search-qwen2.5-7b-grpo-155-step-v1

8B • Updated Jul 29 • 5

ZhuofengLi/octo-search-qwen2.5-7b-grpo-step-60-v1.5

2B • Updated Jul 28 • 6

ZhuofengLi/tool-n1-multi-turn-reason-lora-sft-1180-step

Text Generation • 8B • Updated Jul 14 • 5

ZhuofengLi/xlam-reason-lora-sft-1340-step

Text Generation • 3B • Updated Jul 13 • 6

ZhuofengLi/tool-n1-reason-lora-sft-800-step

Text Generation • 8B • Updated Jul 4 • 7

ZhuofengLi/pot-r1-grpo-qwen2.5-7b-Instruct

Text Generation • 8B • Updated Mar 30 • 3

ZhuofengLi/pot-r1-grpo-qwen2.5-1.5b-Instruct

Text Generation • 2B • Updated Mar 30

ZhuofengLi/pot-r1-grpo-qwen2.5-1.5b-Instruct-wo-warmup

Text Generation • 2B • Updated Mar 28

ZhuofengLi/Qwen2.5-1.5B-Open-R1-GRPO

Updated Mar 26

ZhuofengLi/pot-r1-grpo-qwen2.5-7b-Instruct-wo-warmup

Text Generation • 8B • Updated Mar 25

ZhuofengLi/SciBART-original

Updated Jul 4, 2024
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs