Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
huzican 's Collections
DIVER

DIVER

updated 24 days ago

Diversity-Incentivized Exploration for Versatile Reasoning

Upvote
-

  • Diversity-Incentivized Exploration for Versatile Reasoning

    Paper • 2509.26209 • Published Sep 30 • 16

  • huzican/DIVER-TD-Qwen2.5-Math-7B

    7B • Updated 24 days ago • 15

  • huzican/DIVER-ED-Qwen2.5-Math-7B

    7B • Updated 24 days ago

  • huzican/Baseline-Entropy-RL-Qwen2.5-Math-7B

    7B • Updated 24 days ago • 13

  • huzican/Baseline-Passk-Training-Qwen2.5-Math-7B

    7B • Updated 24 days ago • 10

  • huzican/Baseline-Clip-Higher-Qwen2.5-Math-7B

    7B • Updated 24 days ago • 12

  • huzican/DIVER-Training-Openr1-Math-46k

    Viewer • Updated 24 days ago • 45.8k • 21

  • huzican/Qwen2.5-Math-7B-16k-think

    7B • Updated 24 days ago • 14

  • huzican/DIVER-Test

    Viewer • Updated 24 days ago • 6.02k • 14
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs