Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Zhang Xingjian's picture

15 2

Zhang Xingjian

Zhang199

xingjianll's profile picture

·

ZhangXJ199

AI & ML interests

Large Multimodal Models

Organizations

None yet

Zhang199 's collections 4

Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity

Zhang199/EDGE-GRPO-Qwen-7B

Text Generation • 8B • Updated Jul 30 • 9
Zhang199/EDGE-GRPO-Qwen-1.5B

Text Generation • 2B • Updated Jul 30 • 5
EDGE-GRPO: Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity

Paper • 2507.21848 • Published Jul 29 • 8

TinyLLaVA-Video

A Simple Framework of Small-scale LMMs for Video Understanding.

Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Group-16-512

Video-Text-to-Text • 4B • Updated Jun 12 • 81 • 1
Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Group-1fps-512

Video-Text-to-Text • 4B • Updated Apr 24 • 187
Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Naive-16-512

Video-Text-to-Text • 4B • Updated Jun 12 • 17
Zhang199/TinyLLaVA-Video-Phi2-Naive-16-512

Video-Text-to-Text • 3B • Updated Jun 12 • 24

TinyLLaVA-Video-R1

Towards Smaller LMMs for Video Reasoning.

Zhang199/TinyLLaVA-Video-R1

Video-Text-to-Text • 4B • Updated Apr 24 • 20 • 4
Zhang199/TinyLLaVA-Video-Coldstart_NextQA_16

Video-Text-to-Text • 4B • Updated Apr 24 • 16 • 1
Zhang199/TinyLLaVA-Video-R1-training-data

Updated Apr 24 • 32 • 1
TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning

Paper • 2504.09641 • Published Apr 13 • 16

A Framework of Small-scale Large Multimodal Models.

Zhang199/TinyLLaVA-Qwen2.5-3B-SigLIP

Image-Text-to-Text • 4B • Updated May 29 • 19
Zhang199/TinyLLaVA-Qwen2-0.5B-SigLIP

Image-Text-to-Text • 1B • Updated Aug 10 • 1.92k • 5
TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models

Paper • 2405.11788 • Published May 20, 2024

Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity

Zhang199/EDGE-GRPO-Qwen-7B

Text Generation • 8B • Updated Jul 30 • 9
Zhang199/EDGE-GRPO-Qwen-1.5B

Text Generation • 2B • Updated Jul 30 • 5
EDGE-GRPO: Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity

Paper • 2507.21848 • Published Jul 29 • 8

TinyLLaVA-Video-R1

Towards Smaller LMMs for Video Reasoning.

Zhang199/TinyLLaVA-Video-R1

Video-Text-to-Text • 4B • Updated Apr 24 • 20 • 4
Zhang199/TinyLLaVA-Video-Coldstart_NextQA_16

Video-Text-to-Text • 4B • Updated Apr 24 • 16 • 1
Zhang199/TinyLLaVA-Video-R1-training-data

Updated Apr 24 • 32 • 1
TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning

Paper • 2504.09641 • Published Apr 13 • 16

TinyLLaVA-Video

A Simple Framework of Small-scale LMMs for Video Understanding.

Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Group-16-512

Video-Text-to-Text • 4B • Updated Jun 12 • 81 • 1
Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Group-1fps-512

Video-Text-to-Text • 4B • Updated Apr 24 • 187
Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Naive-16-512

Video-Text-to-Text • 4B • Updated Jun 12 • 17
Zhang199/TinyLLaVA-Video-Phi2-Naive-16-512

Video-Text-to-Text • 3B • Updated Jun 12 • 24

A Framework of Small-scale Large Multimodal Models.

Zhang199/TinyLLaVA-Qwen2.5-3B-SigLIP

Image-Text-to-Text • 4B • Updated May 29 • 19
Zhang199/TinyLLaVA-Qwen2-0.5B-SigLIP

Image-Text-to-Text • 1B • Updated Aug 10 • 1.92k • 5
TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models

Paper • 2405.11788 • Published May 20, 2024

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs