2 34 21

Wujian Peng(SII)

wjpoom

https://scholar.google.com/citations?user=GTuWk9YAAAAJ&hl=zh-CN

wjpoom

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

upvoted a paper about 2 months ago

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

updated a model 4 months ago

wjpoom/SPEC-CLIP-ViT-B-32

View all activity

Organizations

upvoted a paper 11 days ago

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

Paper • 2510.13626 • Published 12 days ago • 42

upvoted a paper about 2 months ago

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28 • 89

updated a model 4 months ago

wjpoom/SPEC-CLIP-ViT-B-32

Updated Jun 16 • 1

published a model 4 months ago

wjpoom/SPEC-CLIP-ViT-B-32

Updated Jun 16 • 1

upvoted 2 papers 5 months ago

Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment

Paper • 2505.18600 • Published May 24 • 48

CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models

Paper • 2505.12504 • Published May 18 • 24

upvoted a paper 6 months ago

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published May 6 • 93

authored a paper 7 months ago

CoMP: Continual Multimodal Pre-training for Vision Foundation Models

Paper • 2503.18931 • Published Mar 24 • 30

upvoted a paper 7 months ago

CoMP: Continual Multimodal Pre-training for Vision Foundation Models

Paper • 2503.18931 • Published Mar 24 • 30

upvoted 2 papers 8 months ago

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published Mar 13 • 55

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7 • 123

updated 2 datasets 8 months ago

Inst-IT/Inst-It-Bench

Viewer • Updated Mar 3 • 4.07k • 230 • 1

Inst-IT/Inst-It-Dataset

Viewer • Updated Mar 1 • 72.5k • 310 • 10

updated a Space 8 months ago

README

🐨

Boosting Multimodal Understanding at Instance-Level

published a Space 8 months ago

README

🐨

Boosting Multimodal Understanding at Instance-Level

updated a collection 8 months ago

Inst-IT Models

Collection

A series of LMMs finetuned with the Inst-IT Dataset, skilled in fine-grained image/video understanding at the instance-level. • 2 items • Updated Mar 17

updated a model 8 months ago

Inst-IT/LLaVA-Next-Inst-It-Qwen2-7B

Video-Text-to-Text • 8B • Updated Feb 21 • 7 • 3

liked a dataset 8 months ago

Inst-IT/Inst-It-Bench

Viewer • Updated Mar 3 • 4.07k • 230 • 1

updated a model 8 months ago

Inst-IT/LLaVA-Next-Inst-It-Vicuna-7B

Video-Text-to-Text • 7B • Updated Feb 20 • 16 • 2

Wujian Peng(SII)

AI & ML interests

Recent Activity

Organizations

wjpoom's activity

README

README