12 17 44

XuHao Hu

Foreshhh

AI & ML interests

NLP MM

Recent Activity

upvoted a paper 8 days ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

upvoted a paper 23 days ago

Geometrically-Constrained Agent for Spatial Reasoning

upvoted a collection about 1 month ago

CapRL

View all activity

Organizations

upvoted a paper 8 days ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

Paper • 2512.12967 • Published 10 days ago • 98

upvoted a paper 23 days ago

Geometrically-Constrained Agent for Spatial Reasoning

Paper • 2511.22659 • Published 27 days ago • 40

upvoted a collection about 1 month ago

CapRL

Collection

Data & Models for CapRL • 8 items • Updated Oct 22 • 6

upvoted 2 papers 2 months ago

VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos

Paper • 2510.19488 • Published Oct 22 • 19

BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities

Paper • 2510.08759 • Published Oct 9 • 46

upvoted 2 papers 3 months ago

LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions

Paper • 2510.08211 • Published Oct 9 • 22

Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Step

Paper • 2509.23924 • Published Sep 28 • 8

upvoted a collection 3 months ago

Qwen3

Collection

84 items • Updated Aug 6 • 1.52k

upvoted a paper 3 months ago

How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective

Paper • 2509.18905 • Published Sep 23 • 29

upvoted a paper 6 months ago

VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning

Paper • 2506.09049 • Published Jun 10 • 37

upvoted a paper 7 months ago

RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics

Paper • 2506.04308 • Published Jun 4 • 43

upvoted a paper 9 months ago

Position: Interactive Generative Video as Next-Generation Game Engine

Paper • 2503.17359 • Published Mar 21 • 61

upvoted 2 papers about 1 year ago

Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection

Paper • 2412.04455 • Published Dec 5, 2024 • 38

VLSBench: Unveiling Visual Leakage in Multimodal Safety

Paper • 2411.19939 • Published Nov 29, 2024 • 10

upvoted 2 papers over 1 year ago

SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models

Paper • 2402.05044 • Published Feb 7, 2024 • 2

FLAME: Factuality-Aware Alignment for Large Language Models

Paper • 2405.01525 • Published May 2, 2024 • 27

upvoted a paper almost 2 years ago

InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions

Paper • 2402.03040 • Published Feb 5, 2024 • 19

XuHao Hu

AI & ML interests

Recent Activity

Organizations

Foreshhh's activity