Mono-InternVL - a OpenGVLab Collection

OpenGVLab 's Collections

InternVideo-Next

Vlaser

NaViL

InternVL3.5-Flash

InternVL3.5-Core

SDLM

ZeroGUI

PIIP

VideoChat-Flash

InternVL2.5-MPO

V2PE

InternVL Adaptation

All-Seeing Project

PVT v2

Mono-InternVL

updated Sep 28

A Pioneering Monolithic MLLM

Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training

Paper • 2410.08202 • Published Oct 10, 2024 • 4

Note CVPR 2025
Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models

Paper • 2507.12566 • Published Jul 16 • 14
OpenGVLab/Mono-InternVL-2B

Image-Text-to-Text • 3B • Updated Jul 22 • 8.68k • 36
OpenGVLab/Mono-InternVL-2B-S1-1

Image-Text-to-Text • 3B • Updated Jul 22 • 70
OpenGVLab/Mono-InternVL-2B-S1-2

Image-Text-to-Text • 3B • Updated Jul 22 • 74 • 1
OpenGVLab/Mono-InternVL-2B-S1-3

Image-Text-to-Text • 3B • Updated Jul 22 • 79 • 1
OpenGVLab/Mono-InternVL-2B-Synthetic-Data

Viewer • Updated Jul 22 • 3.05k • 55 • 2