Spaces:
Runtime error
Runtime error
Update README.md
Browse files
README.md
CHANGED
|
@@ -4,48 +4,9 @@ emoji: 🐵
|
|
| 4 |
colorFrom: blue
|
| 5 |
colorTo: green
|
| 6 |
sdk: gradio
|
| 7 |
-
sdk_version: 5.
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
license: apache-2.0
|
| 11 |
-
python_version: 3.
|
| 12 |
-
---
|
| 13 |
-
|
| 14 |
-
# MonkeyOCR Document Parser
|
| 15 |
-
|
| 16 |
-
MonkeyOCR是一个轻量级的多模态文档解析模型,采用Structure-Recognition-Relation (SRR)三元组范式。
|
| 17 |
-
|
| 18 |
-
## 功能特性
|
| 19 |
-
|
| 20 |
-
- 🔍 **高精度识别**: 支持中英文文档解析
|
| 21 |
-
- 📊 **表格提取**: 智能识别和提取表格数据
|
| 22 |
-
- 🧮 **公式解析**: 准确识别数学公式
|
| 23 |
-
- 📝 **结构化输出**: 输出Markdown格式结果
|
| 24 |
-
- ⚡ **高效处理**: 0.84页/秒的处理速度
|
| 25 |
-
|
| 26 |
-
## 使用方法
|
| 27 |
-
|
| 28 |
-
1. 上传PDF文档或图片文件
|
| 29 |
-
2. 输入解析提示词(可选)
|
| 30 |
-
3. 点击"开始解析"按钮
|
| 31 |
-
4. 查看Markdown格式的解析结果
|
| 32 |
-
|
| 33 |
-
## 模型信息
|
| 34 |
-
|
| 35 |
-
- **参数量**: 3B
|
| 36 |
-
- **支持语言**: 中文、英文
|
| 37 |
-
- **支持格式**: PDF, PNG, JPG, JPEG
|
| 38 |
-
- **基础模型**: 基于Qwen2.5-VL
|
| 39 |
-
|
| 40 |
-
## 引用
|
| 41 |
-
|
| 42 |
-
```bibtex
|
| 43 |
-
@misc{li2025monkeyocrdocumentparsingstructurerecognitionrelation,
|
| 44 |
-
title={MonkeyOCR: Document Parsing with a Structure-Recognition-Relation Triplet Paradigm},
|
| 45 |
-
author={Zhang Li and Yuliang Liu and Qiang Liu and Zhiyin Ma and Ziyang Zhang and Shuo Zhang and Zidun Guo and Jiarui Zhang and Xinyu Wang and Xiang Bai},
|
| 46 |
-
year={2025},
|
| 47 |
-
eprint={2506.05218},
|
| 48 |
-
archivePrefix={arXiv},
|
| 49 |
-
primaryClass={cs.CV},
|
| 50 |
-
url={https://arxiv.org/abs/2506.05218},
|
| 51 |
-
}
|
|
|
|
| 4 |
colorFrom: blue
|
| 5 |
colorTo: green
|
| 6 |
sdk: gradio
|
| 7 |
+
sdk_version: 5.23.3
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
license: apache-2.0
|
| 11 |
+
python_version: "3.10"
|
| 12 |
+
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|