yl-1993's picture

2 10 2

yl-1993

yl-1993

·

[email protected]

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

upvoted a paper 28 days ago

Visual Jigsaw Post-Training Improves MLLMs

upvoted a paper about 2 months ago

ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

View all activity

Organizations

upvoted a paper 7 days ago

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Paper • 2510.14979 • Published 12 days ago • 65

upvoted a paper 28 days ago

Visual Jigsaw Post-Training Improves MLLMs

Paper • 2509.25190 • Published 29 days ago • 35

upvoted a paper about 2 months ago

ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

Paper • 2508.21496 • Published Aug 29 • 54

upvoted 3 papers 2 months ago

EgoTwin: Dreaming Body and View in First Person

Paper • 2508.13013 • Published Aug 18 • 20

Has GPT-5 Achieved Spatial Intelligence? An Empirical Study

Paper • 2508.13142 • Published Aug 18 • 34

4DNeX: Feed-Forward 4D Generative Modeling Made Easy

Paper • 2508.13154 • Published Aug 18 • 62

upvoted an article 6 months ago

Article

FramePack LoRA Experiment

By

•

Apr 19

• 22

upvoted a paper 8 months ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published Mar 5 • 45

upvoted 2 papers 11 months ago

SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters

Paper • 2412.00174 • Published Nov 29, 2024 • 23

Trajectory Attention for Fine-grained Video Motion Control

Paper • 2411.19324 • Published Nov 28, 2024 • 13