Hao Shao's picture

2 10 4

Hao Shao

deepcs233

·

https://hao-shao.com/

deepcs233

AI & ML interests

None yet

Recent Activity

upvoted a paper 25 days ago

V-ReasonBench: Toward Unified Reasoning Benchmark Suite for Video Generation Models

updated a model 2 months ago

deepcs233/VividFace

published a model 2 months ago

deepcs233/VividFace

View all activity

Organizations

authored 5 papers 12 months ago

LMDrive: Closed-Loop End-to-End Driving with Large Language Models

Paper • 2312.07488 • Published Dec 12, 2023

Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models

Paper • 2403.16999 • Published Mar 25, 2024 • 5

MoVA: Adapting Mixture of Vision Experts to Multimodal Context

Paper • 2404.13046 • Published Apr 19, 2024 • 1

VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping

Paper • 2412.11279 • Published Dec 15, 2024 • 13

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Paper • 2412.09618 • Published Dec 12, 2024 • 21

authored a paper almost 2 years ago

SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

Paper • 2402.05935 • Published Feb 8, 2024 • 17