Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
hhdqirui
/
Qwen2-7B-Instruct-GRPO-8
like
0
Transformers
TensorBoard
Safetensors
AI-MO/NuminaMath-TIR
Generated from Trainer
trl
grpo
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Deploy
Use this model
421caa8
Qwen2-7B-Instruct-GRPO-8
/
adapter_model.safetensors
Commit History
Training in progress, step 226
2b5424c
verified
hhdqirui
commited on
Apr 28
Training in progress, step 220
1ee1117
verified
hhdqirui
commited on
Apr 28
Training in progress, step 210
b0e8abe
verified
hhdqirui
commited on
Apr 28
Training in progress, step 200
1210868
verified
hhdqirui
commited on
Apr 28
Training in progress, step 190
9a59a09
verified
hhdqirui
commited on
Apr 28
Training in progress, step 180
e7a22e5
verified
hhdqirui
commited on
Apr 28
Training in progress, step 170
0fef686
verified
hhdqirui
commited on
Apr 28
Training in progress, step 160
33b568e
verified
hhdqirui
commited on
Apr 28
Training in progress, step 150
2ec596f
verified
hhdqirui
commited on
Apr 28
Training in progress, step 140
08829b3
verified
hhdqirui
commited on
Apr 28
Training in progress, step 130
3d82301
verified
hhdqirui
commited on
Apr 28
Training in progress, step 120
fc8eeff
verified
hhdqirui
commited on
Apr 28
Training in progress, step 110
f999921
verified
hhdqirui
commited on
Apr 28
Training in progress, step 100
56346ca
verified
hhdqirui
commited on
Apr 28
Training in progress, step 90
f7b806e
verified
hhdqirui
commited on
Apr 28
Training in progress, step 80
490a96d
verified
hhdqirui
commited on
Apr 28
Training in progress, step 70
e0b366e
verified
hhdqirui
commited on
Apr 28
Training in progress, step 60
403d6ef
verified
hhdqirui
commited on
Apr 28
Training in progress, step 50
1f860b2
verified
hhdqirui
commited on
Apr 28
Training in progress, step 40
bb53af2
verified
hhdqirui
commited on
Apr 28
Training in progress, step 30
2d70a40
verified
hhdqirui
commited on
Apr 28
Training in progress, step 20
858c73e
verified
hhdqirui
commited on
Apr 28
Training in progress, step 10
961e9b3
verified
hhdqirui
commited on
Apr 28