-
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 189 -
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection
Paper • 2310.02960 • Published • 1 -
microsoft/phi-2
Text Generation • 3B • Updated • 711k • 3.41k
Johnny No. 5
johnny-numero-5
·
AI & ML interests
Graph RAG & Graph Neural Networks
Recent Activity
liked
a model
about 1 month ago
ibm-granite/granite-docling-258M
liked
a model
about 1 month ago
docling-project/docling-models
liked
a model
12 months ago
vidore/colpali-v1.2
Organizations
None yet