ERNIE 4.5 collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. baidu/ERNIE-4.5-21B-A3B-Thinking Text Generation • 22B • Updated 3 days ago • 1.02k • • 756 baidu/ERNIE-4.5-VL-424B-A47B-Base-Paddle Image-Text-to-Text • 424B • Updated Aug 19 • 26 • 60 baidu/ERNIE-4.5-VL-424B-A47B-Base-PT Image-Text-to-Text • 424B • Updated Sep 1 • 101 • • 69 baidu/ERNIE-4.5-300B-A47B-Base-Paddle Text Generation • 299B • Updated Aug 20 • 61 • 17
Qianfan-VL Qianfan-vl model series. The models are mainly domain enhanced vision language model, targeting enterprise level multi modal understanding scenarios. baidu/Qianfan-VL-70B Image-Text-to-Text • 72B • Updated Sep 19 • 161 • 33 baidu/Qianfan-VL-8B Image-Text-to-Text • 9B • Updated Sep 19 • 704 • 33 baidu/Qianfan-VL-3B Image-Text-to-Text • 4B • Updated Sep 19 • 405 • 27 Running 2 2 Qianfan VL Demo 💬 Domain-Enhanced Universal Vision-Language Models
ERNIE 4.5 collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. baidu/ERNIE-4.5-21B-A3B-Thinking Text Generation • 22B • Updated 3 days ago • 1.02k • • 756 baidu/ERNIE-4.5-VL-424B-A47B-Base-Paddle Image-Text-to-Text • 424B • Updated Aug 19 • 26 • 60 baidu/ERNIE-4.5-VL-424B-A47B-Base-PT Image-Text-to-Text • 424B • Updated Sep 1 • 101 • • 69 baidu/ERNIE-4.5-300B-A47B-Base-Paddle Text Generation • 299B • Updated Aug 20 • 61 • 17
Qianfan-VL Qianfan-vl model series. The models are mainly domain enhanced vision language model, targeting enterprise level multi modal understanding scenarios. baidu/Qianfan-VL-70B Image-Text-to-Text • 72B • Updated Sep 19 • 161 • 33 baidu/Qianfan-VL-8B Image-Text-to-Text • 9B • Updated Sep 19 • 704 • 33 baidu/Qianfan-VL-3B Image-Text-to-Text • 4B • Updated Sep 19 • 405 • 27 Running 2 2 Qianfan VL Demo 💬 Domain-Enhanced Universal Vision-Language Models