Rui
Yalimu
·
AI & ML interests
None yet
Recent Activity
commented on
a paper
14 days ago
One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy
Gradient
upvoted
a
paper
21 days ago
One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy
Gradient
commented on
a paper
22 days ago
One-Token Rollout: Guiding Supervised Fine-Tuning of LLMs with Policy
Gradient