Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
6
17
10
ZhuofengLi
PRO
ZhuofengLi
Follow
Nikita888's profile picture
linz's profile picture
eigentom's profile picture
6 followers
·
11 following
https://github.com/Zhuofeng-Li
zhuofengli96475
Zhuofeng-Li
zhuofeng-li-6a528626a
AI & ML interests
Agents, Reasoning LLMs/VLLMs, RL
Organizations
Papers
6
arxiv:
2510.05592
arxiv:
2509.22799
arxiv:
2509.01055
arxiv:
2505.20139
Expand 6 papers
models
13
Sort: Recently updated
ZhuofengLi/torl-qwen2.5-7b-instruct
8B
•
Updated
Sep 11
•
1
ZhuofengLi/octo-science-qwen2.5-7b-grpo-step-40-v2
2B
•
Updated
Aug 3
•
6
ZhuofengLi/octo-search-qwen2.5-7b-grpo-155-step-v1
8B
•
Updated
Jul 29
•
5
ZhuofengLi/octo-search-qwen2.5-7b-grpo-step-60-v1.5
2B
•
Updated
Jul 28
•
6
ZhuofengLi/tool-n1-multi-turn-reason-lora-sft-1180-step
Text Generation
•
8B
•
Updated
Jul 14
•
5
ZhuofengLi/xlam-reason-lora-sft-1340-step
Text Generation
•
3B
•
Updated
Jul 13
•
6
ZhuofengLi/tool-n1-reason-lora-sft-800-step
Text Generation
•
8B
•
Updated
Jul 4
•
4
ZhuofengLi/pot-r1-grpo-qwen2.5-7b-Instruct
Text Generation
•
8B
•
Updated
Mar 30
•
4
ZhuofengLi/pot-r1-grpo-qwen2.5-1.5b-Instruct
Text Generation
•
2B
•
Updated
Mar 30
ZhuofengLi/pot-r1-grpo-qwen2.5-1.5b-Instruct-wo-warmup
Text Generation
•
2B
•
Updated
Mar 28
View 13 models
datasets
8
Sort: Recently updated
ZhuofengLi/sft_data
Viewer
•
Updated
Sep 19
•
8.4k
•
37
ZhuofengLi/gpqa_mcq
Viewer
•
Updated
Jul 14
•
198
•
6
ZhuofengLi/Big-Math-RL-Verified
Viewer
•
Updated
Mar 14
•
251k
•
8
ZhuofengLi/rerank_public_dataset
Updated
Nov 23, 2024
•
2
ZhuofengLi/TEG-Datasets
Preview
•
Updated
Oct 29, 2024
•
208
•
4
ZhuofengLi/citation-network
Preview
•
Updated
Oct 19, 2024
•
11
ZhuofengLi/MDS
Viewer
•
Updated
Oct 16, 2024
•
97.5k
•
8
ZhuofengLi/survey-sections-2k
Viewer
•
Updated
Aug 2, 2024
•
2k
•
7