Haoyu Guo's picture

42 2

Haoyu Guo

ghy0324

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

RL makes MLLMs see better than SFT

upvoted a paper 4 days ago

DeepSeek-OCR: Contexts Optical Compression

upvoted a paper 11 days ago

Generative Universal Verifier as Multimodal Meta-Reasoner

View all activity

Organizations

upvoted 2 papers 4 days ago

RL makes MLLMs see better than SFT

Paper • 2510.16333 • Published 9 days ago • 43

DeepSeek-OCR: Contexts Optical Compression

Paper • 2510.18234 • Published 6 days ago • 55

upvoted a paper 11 days ago

Generative Universal Verifier as Multimodal Meta-Reasoner

Paper • 2510.13804 • Published 11 days ago • 24

upvoted a paper 12 days ago

A Survey of Vibe Coding with Large Language Models

Paper • 2510.12399 • Published 12 days ago • 45

upvoted 4 papers 14 days ago

SpaceVista: All-Scale Visual Spatial Reasoning from mm to km

Paper • 2510.09606 • Published 16 days ago • 17

Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels

Paper • 2510.06499 • Published 19 days ago • 31

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Paper • 2510.09608 • Published 16 days ago • 48

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Paper • 2510.08673 • Published 17 days ago • 117

upvoted a paper 17 days ago

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published 17 days ago • 66

upvoted 3 papers 18 days ago

Bridging Text and Video Generation: A Survey

Paper • 2510.04999 • Published 20 days ago • 3

SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models

Paper • 2510.06917 • Published 18 days ago • 34

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published 19 days ago • 51

upvoted 4 papers 19 days ago

It Takes Two: Your GRPO Is Secretly DPO

Paper • 2510.00977 • Published 25 days ago • 31

VChain: Chain-of-Visual-Thought for Reasoning in Video Generation

Paper • 2510.05094 • Published 20 days ago • 35

Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published 20 days ago • 106

Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models

Paper • 2510.05034 • Published 20 days ago • 45

upvoted 4 papers 23 days ago

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published 27 days ago • 133

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published 25 days ago • 86

ExGRPO: Learning to Reason from Experience

Paper • 2510.02245 • Published 24 days ago • 76

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published 24 days ago • 91