Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
TsinghuaC3I
's Collections
SSRL
UltraMedical
SSRL
updated
Aug 18
Upvote
2
TsinghuaC3I/SSRL
Preview
•
Updated
Aug 5
•
29
•
2
TsinghuaC3I/Llama-3.1-8B-Instruct-SSRL
Text Generation
•
8B
•
Updated
Aug 5
•
12
TsinghuaC3I/Llama-3.2-3B-Instruct-SSRL
Text Generation
•
4B
•
Updated
Aug 5
•
14
TsinghuaC3I/Qwen2.5-7B-Instruct-SSRL
Text Generation
•
8B
•
Updated
Aug 5
•
9
TsinghuaC3I/Qwen2.5-3B-Instruct-SSRL
Text Generation
•
3B
•
Updated
Aug 5
•
11
•
1
SSRL: Self-Search Reinforcement Learning
Paper
•
2508.10874
•
Published
Aug 14
•
94
Upvote
2
Share collection
View history
Collection guide
Browse collections