Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RLVER
/
GRPO-non-thinking
like
0
Safetensors
qwen2
arxiv:
2507.03112
License:
license
Model card
Files
Files and versions
xet
Community
1
RLVER
commited on
Jul 9
Commit
afbf61c
·
verified
·
1 Parent(s):
b98824b
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+2
-1
README.md
CHANGED
Viewed
@@ -4,4 +4,5 @@ license_name: license
4
license_link: LICENSE
5
base_model:
6
- Qwen/Qwen2.5-7B-Instruct
7
-
---
4
license_link: LICENSE
5
base_model:
6
- Qwen/Qwen2.5-7B-Instruct
7
+
---
8
+
https://www.arxiv.org/abs/2507.03112