Rui's picture

1 1

Rui

Yalimu

·

AI & ML interests

None yet

Recent Activity

commented on a paper 14 days ago

One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy Gradient

upvoted a paper 21 days ago

One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy Gradient

commented on a paper 22 days ago

One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy Gradient

View all activity

Organizations

Collections 2

Papers 4

arxiv:2509.26313

arxiv:2505.12723

arxiv:2505.12717

arxiv:2502.12502

models 0

None public yet

datasets 0

None public yet