PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper • 2510.14528 • Published 11 days ago • 69
view article Article Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips By baidu • Sep 24 • 8
Qianfan-VL Collection Qianfan-vl model series. The models are mainly domain enhanced vision language model, targeting enterprise level multi modal understanding scenarios. • 4 items • Updated Sep 24 • 19
view article Article Unleashing the Full Potential of ERNIE4.5 using FastDeploy By baidu and 3 others • Sep 19 • 11
view article Article PP-OCRv5 on Hugging Face: A Specialized Approach to OCR By baidu and 5 others • Sep 10 • 108
PP-OCRv5 Collection PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese • 13 items • Updated Sep 15 • 48
ERNIE 4.5 Collection collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 26 items • Updated Sep 24 • 174