arxiv:2412.04468
Yuxian Gu
t1101675
AI & ML interests
Efficient methods for language models
Recent Activity
upvoted
a
paper
6 days ago
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
updated
a Space
about 1 month ago
t1101675/trackio
published
a Space
about 1 month ago
t1101675/trackio