Qwen2.5-VL-32B-OCR

This is a LoRA adapter for Qwen2.5-VL-32B fine-tuned on Optical Character Recognition tasks.

Model Details

  • Base Model: Qwen/Qwen2.5-VL-32B
  • Task: Optical Character Recognition
  • Training Method: LoRA (Low-Rank Adaptation)
  • Framework: Transformers

Usage

from transformers import AutoModelForVision2Seq, AutoProcessor
from peft import PeftModel

# Load base model
base_model = AutoModelForVision2Seq.from_pretrained(
    "Qwen/Qwen2.5-VL-32B",
    trust_remote_code=True
)

# Load LoRA adapter
model = PeftModel.from_pretrained(base_model, "yongxianwei/Qwen2.5-VL-32B-OCR")
processor = AutoProcessor.from_pretrained("Qwen/Qwen2.5-VL-32B", trust_remote_code=True)

# Inference
# ... your inference code ...

Training Details

Fine-tuned using LoRA on specific Optical Character Recognition datasets.

Citation

@misc{qwen2.5-vl-optical character recognition,
  author = {Yongxian Wei},
  title = {Qwen2.5-VL LoRA for Optical Character Recognition},
  year = {2024},
  publisher = {Hugging Face},
  url = {https://huggingface.co/yongxianwei/Qwen2.5-VL-32B-OCR}
}
Downloads last month
11
Safetensors
Model size
33B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for yongxianwei/Qwen2.5-VL-32B-OCR

Adapters
1 model

Collection including yongxianwei/Qwen2.5-VL-32B-OCR