Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spiral RL

community
https://github.com/spiral-rl/spiral
spiral-rl
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

simonycl  authored a paper 11 days ago
Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity
Benjamin-eecs  authored a paper 13 days ago
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution
Benjamin-eecs  authored a paper 16 days ago
Agent Learning via Early Experience
View all activity

Leon Guertler's profile picture Bo Liu's profile picture Simon Yu's profile picture Zichen's profile picture

spiral-rl 's collections 1

SPIRAL
  • spiral-rl/Spiral-Qwen3-4B

    Text Generation • 4B • Updated Jul 5 • 9 • 4
  • spiral-rl/Spiral-DeepSeek-R1-Distill-Qwen-7B

    Text Generation • 8B • Updated Jul 5 • 5 • 2
  • spiral-rl/Spiral-Kuhn-Poker-Qwen3-32B-SFT

    Viewer • Updated Jul 5 • 25.5k • 17
  • SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

    Paper • 2506.24119 • Published Jun 30 • 50
SPIRAL
  • spiral-rl/Spiral-Qwen3-4B

    Text Generation • 4B • Updated Jul 5 • 9 • 4
  • spiral-rl/Spiral-DeepSeek-R1-Distill-Qwen-7B

    Text Generation • 8B • Updated Jul 5 • 5 • 2
  • spiral-rl/Spiral-Kuhn-Poker-Qwen3-32B-SFT

    Viewer • Updated Jul 5 • 25.5k • 17
  • SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

    Paper • 2506.24119 • Published Jun 30 • 50
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs