Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Rui's picture
1 1

Rui

Yalimu
·

AI & ML interests

None yet

Recent Activity

commented on a paper 14 days ago
One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy Gradient
upvoted a paper 21 days ago
One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy Gradient
commented on a paper 22 days ago
One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy Gradient
View all activity

Organizations

The Chinese University of Hong Kong's profile picture

Collections 2

Reasoning
  • open-r1/OpenR1-Math-220k

    Viewer • Updated Feb 18 • 450k • 7.68k • 656
Project Ⅰ
CMSC 5720
  • nvidia/ChatQA2-Long-SFT-data

    Viewer • Updated Sep 9, 2024 • 117k • 264 • 32
  • zai-org/LongCite-45k

    Viewer • Updated Oct 18, 2024 • 29.9k • 117 • 69
  • TIGER-Lab/LongRAG

    Viewer • Updated Jun 26, 2024 • 9.59M • 856 • 17
  • neural-bridge/rag-dataset-12000

    Viewer • Updated Feb 5, 2024 • 12k • 2k • 145
Reasoning
  • open-r1/OpenR1-Math-220k

    Viewer • Updated Feb 18 • 450k • 7.68k • 656
Project Ⅰ
CMSC 5720
  • nvidia/ChatQA2-Long-SFT-data

    Viewer • Updated Sep 9, 2024 • 117k • 264 • 32
  • zai-org/LongCite-45k

    Viewer • Updated Oct 18, 2024 • 29.9k • 117 • 69
  • TIGER-Lab/LongRAG

    Viewer • Updated Jun 26, 2024 • 9.59M • 856 • 17
  • neural-bridge/rag-dataset-12000

    Viewer • Updated Feb 5, 2024 • 12k • 2k • 145

Papers 4

arxiv:2509.26313
arxiv:2505.12723
arxiv:2505.12717
arxiv:2502.12502

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs