Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
8
31
Peidong Wang
WDong
Follow
21world's profile picture
1 follower
·
2 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
15 days ago
HI-TransPA: Hearing Impairments Translation Personal Assistant
updated
a dataset
2 months ago
WDong/verl-2step-dataset
updated
a model
2 months ago
WDong/verl-2step-model
View all activity
Organizations
WDong
's models
25
Sort: Recently updated
WDong/verl-2step-model
3B
•
Updated
Sep 24
•
2
WDong/verl-16step-model
3B
•
Updated
Sep 24
•
2
WDong/dpo_0625_iter2_after_dpo_0.6
Updated
Jun 28, 2024
•
3
WDong/sft_06221544_policy2
Updated
Jun 28, 2024
•
1
WDong/sft_0626_after_2_dpo_9
Updated
Jun 28, 2024
•
4
WDong/sft_0622_policy2
Updated
Jun 28, 2024
•
2
WDong/dpo_06230018_policy2_0.6
Updated
Jun 28, 2024
•
3
WDong/dpo_06230018_policy2_0.01
Updated
Jun 28, 2024
•
4
WDong/dpo_06221544_policy2
Updated
Jun 28, 2024
•
2
WDong/dpo_0622_policy2
Updated
Jun 28, 2024
•
1
WDong/dpo_0621
Updated
Jun 28, 2024
•
2
WDong/Qwen2-7B-Instruct-dpo-06230018-policy2-0.6
Text Generation
•
8B
•
Updated
Jun 24, 2024
•
3
WDong/lora_06072000
Updated
Jun 8, 2024
•
4
WDong/7B_lora_06051615
Updated
Jun 8, 2024
•
2
WDong/Qwen1.5-7B-sft-0506_9_8
Text Generation
•
8B
•
Updated
May 7, 2024
•
6
WDong/Qwen1.5-7B-sft-0506_7_7
Text Generation
•
8B
•
Updated
May 6, 2024
•
7
WDong/Qwen1.5-7B-sft-0502
Text Generation
•
8B
•
Updated
May 2, 2024
•
4
WDong/7B-0428
Text Generation
•
8B
•
Updated
Apr 28, 2024
•
5
WDong/Qwen1.5-7B-SFT-0425
Updated
Apr 25, 2024
WDong/qwen1.5-1.8B-seed-sft
Text Generation
•
2B
•
Updated
Apr 22, 2024
•
7
•
WDong/CartPole
Reinforcement Learning
•
Updated
Mar 18, 2024
WDong/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Mar 13, 2024
•
9
WDong/Taxi-v3
Reinforcement Learning
•
Updated
Mar 13, 2024
WDong/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Mar 13, 2024
WDong/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Mar 10, 2024
•
9