arxiv:2512.01374
An Yang
yangapku
AI & ML interests
NLP and Deep Learning
Recent Activity
authored
a paper
7 days ago
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
upvoted
a
paper
7 days ago
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
authored
a paper
13 days ago
Qwen-Image Technical Report