Hongbo Peng
M1chaelPeng
AI & ML interests
None yet
Recent Activity
liked
a dataset
about 1 month ago
stepfun-ai/StepFun-Formalizer-Training
upvoted
a
paper
about 2 months ago
Random Policy Valuation is Enough for LLM Reasoning with Verifiable
Rewards
upvoted
a
paper
3 months ago
Video-MTR: Reinforced Multi-Turn Reasoning for Long Video Understanding
Organizations
None yet