Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Pu Fanyi's picture
6 56 124

Pu Fanyi

pufanyi
pbcong's profile picture dark-pen's profile picture Delcos's profile picture
·
https://pufanyi.github.io
  • pufanyi
  • pufanyi

AI & ML interests

CV

Recent Activity

liked a model 2 days ago
facebook/dinov2-large
liked a model 2 days ago
Qwen/Qwen3-VL-2B-Instruct
upvoted a paper 3 days ago
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding
View all activity

Organizations

Nanyang Technological University's profile picture SenseNova's profile picture LMMs-Lab's profile picture LongVa's profile picture Evolve-lmms-lab's profile picture

authored a paper about 1 month ago

Scaling Spatial Intelligence with Multimodal Foundation Models

Paper • 2511.13719 • Published Nov 17 • 45
authored a paper 11 months ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23 • 23
authored a paper over 1 year ago

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models

Paper • 2407.12772 • Published Jul 17, 2024 • 35
authored a paper about 2 years ago

OtterHD: A High-Resolution Multi-modality Model

Paper • 2311.04219 • Published Nov 7, 2023 • 34
authored a paper over 2 years ago

MIMIC-IT: Multi-Modal In-Context Instruction Tuning

Paper • 2306.05425 • Published Jun 8, 2023 • 11
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs