Zhendong Chu
Wesley123
AI & ML interests
Natural Language Processing, Recommender Systems
Recent Activity
upvoted
a
paper
26 days ago
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning
upvoted
a
paper
5 months ago
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement
Learning
Organizations
None yet