Peidong Wang's picture

2 8 31

Peidong Wang

WDong

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 15 days ago

HI-TransPA: Hearing Impairments Translation Personal Assistant

updated a dataset 2 months ago

WDong/verl-2step-dataset

updated a model 2 months ago

WDong/verl-2step-model

View all activity

Organizations

WDong 's models 25

WDong/verl-2step-model

3B • Updated Sep 24 • 2

WDong/verl-16step-model

3B • Updated Sep 24 • 2

WDong/dpo_0625_iter2_after_dpo_0.6

Updated Jun 28, 2024 • 3

WDong/sft_06221544_policy2

Updated Jun 28, 2024 • 1

WDong/sft_0626_after_2_dpo_9

Updated Jun 28, 2024 • 4

WDong/sft_0622_policy2

Updated Jun 28, 2024 • 2

WDong/dpo_06230018_policy2_0.6

Updated Jun 28, 2024 • 3

WDong/dpo_06230018_policy2_0.01

Updated Jun 28, 2024 • 4

WDong/dpo_06221544_policy2

Updated Jun 28, 2024 • 2

WDong/dpo_0622_policy2

Updated Jun 28, 2024 • 1

WDong/dpo_0621

Updated Jun 28, 2024 • 2

WDong/Qwen2-7B-Instruct-dpo-06230018-policy2-0.6

Text Generation • 8B • Updated Jun 24, 2024 • 3

WDong/lora_06072000

Updated Jun 8, 2024 • 4

WDong/7B_lora_06051615

Updated Jun 8, 2024 • 2

WDong/Qwen1.5-7B-sft-0506_9_8

Text Generation • 8B • Updated May 7, 2024 • 6

WDong/Qwen1.5-7B-sft-0506_7_7

Text Generation • 8B • Updated May 6, 2024 • 7

WDong/Qwen1.5-7B-sft-0502

Text Generation • 8B • Updated May 2, 2024 • 4

WDong/7B-0428

Text Generation • 8B • Updated Apr 28, 2024 • 5

WDong/Qwen1.5-7B-SFT-0425

Updated Apr 25, 2024

WDong/qwen1.5-1.8B-seed-sft

Text Generation • 2B • Updated Apr 22, 2024 • 7 •

WDong/CartPole

Reinforcement Learning • Updated Mar 18, 2024

WDong/dqn-SpaceInvadersNoFrameskip-v4

Reinforcement Learning • Updated Mar 13, 2024 • 9

WDong/Taxi-v3

Reinforcement Learning • Updated Mar 13, 2024

WDong/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Mar 13, 2024

WDong/ppo-LunarLander-v2

Reinforcement Learning • Updated Mar 10, 2024 • 9