Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Zidi Xiong
polaris-73
Follow
shanchen's profile picture
1 follower
·
2 following
AI & ML interests
None yet
Recent Activity
published
an
article
about 24 hours ago
Budget Alignment: Making Models Reason in the User’s Language
updated
a model
19 days ago
polaris-73/ds1p5b_grpo_ifeval_skywork_continue-global_step_400
published
a model
19 days ago
polaris-73/ds1p5b_grpo_ifeval_skywork_continue-global_step_400
View all activity
Organizations
polaris-73
's models
91
Sort: Recently updated
polaris-73/ds8b_grpo_math_gsm8k-global_step_200
8B
•
Updated
Aug 12
polaris-73/ds8b_grpo_math_gsm8k-global_step_100
8B
•
Updated
Aug 12
polaris-73/ds1p5b_grpo_skywork-global_step_1200
2B
•
Updated
Aug 1
•
7
polaris-73/ds1p5b_grpo_skywork-global_step_1000
2B
•
Updated
Aug 1
•
3
polaris-73/ds1p5b_grpo_skywork-global_step_800
2B
•
Updated
Aug 1
•
6
polaris-73/ds1p5b_grpo_skywork-global_step_600
2B
•
Updated
Aug 1
•
6
polaris-73/ds1p5b_grpo_skywork-global_step_400
2B
•
Updated
Aug 1
•
3
polaris-73/ds1p5b_grpo_skywork-global_step_200
2B
•
Updated
Aug 1
•
3
polaris-73/ds1p5b_grpo_skywork_cliphigh-global_step_1200
2B
•
Updated
Aug 1
•
5
polaris-73/ds1p5b_grpo_skywork_cliphigh-global_step_1000
2B
•
Updated
Aug 1
•
6
polaris-73/ds1p5b_grpo_skywork_cliphigh-global_step_800
2B
•
Updated
Aug 1
•
3
polaris-73/ds1p5b_grpo_skywork_cliphigh-global_step_600
2B
•
Updated
Aug 1
•
4
polaris-73/ds1p5b_grpo_skywork_cliphigh-global_step_400
2B
•
Updated
Aug 1
•
6
polaris-73/ds1p5b_grpo_skywork_cliphigh-global_step_200
2B
•
Updated
Aug 1
•
3
polaris-73/ds7b_grpo_math_gsm8k_faithful-global_step_870
8B
•
Updated
Aug 1
•
3
polaris-73/ds7b_grpo_math_gsm8k_faithful-global_step_800
8B
•
Updated
Aug 1
•
3
polaris-73/ds7b_grpo_math_gsm8k_faithful-global_step_600
8B
•
Updated
Aug 1
•
6
polaris-73/ds7b_grpo_math_gsm8k_faithful-global_step_400
8B
•
Updated
Aug 1
•
6
polaris-73/ds7b_grpo_math_gsm8k_faithful-global_step_200
8B
•
Updated
Aug 1
•
6
polaris-73/qwen7b_grpo_math_gsm8k_cliphigh-global_step_870
8B
•
Updated
Aug 1
•
3
polaris-73/qwen7b_grpo_math_gsm8k_cliphigh-global_step_800
8B
•
Updated
Aug 1
•
6
polaris-73/qwen7b_grpo_math_gsm8k_cliphigh-global_step_600
8B
•
Updated
Aug 1
•
5
polaris-73/qwen7b_grpo_math_gsm8k_cliphigh-global_step_400
8B
•
Updated
Aug 1
•
6
polaris-73/qwen7b_grpo_math_gsm8k_cliphigh-global_step_200
8B
•
Updated
Aug 1
•
6
polaris-73/ds7b_grpo_math_gsm8k_rloo-global_step_870
8B
•
Updated
Aug 1
•
6
polaris-73/ds7b_grpo_math_gsm8k_rloo-global_step_800
8B
•
Updated
Aug 1
•
6
polaris-73/ds7b_grpo_math_gsm8k_rloo-global_step_600
8B
•
Updated
Aug 1
•
6
polaris-73/ds7b_grpo_math_gsm8k_rloo-global_step_400
8B
•
Updated
Aug 1
•
6
polaris-73/ds7b_grpo_math_gsm8k_rloo-global_step_200
8B
•
Updated
Aug 1
•
6
polaris-73/ds7b_grpo_math_gsm8k_reinforce-global_step_870
8B
•
Updated
Aug 1
•
3
Previous
1
2
3
4
Next