This collection hosts a series of Vision Language Models (VLMs) fine-tuned for Optical Character Recognition (OCR) and Document Processing.
-
loay/Arabic-OCR-Qwen2.5-VL-7B-Vision
Image-to-Text β’ 8B β’ Updated β’ 102 β’ 3 -
loay/Arabic-OCR-DeepSeek-OCR-2
Image-to-Text β’ 3B β’ Updated β’ 82 -
loay/English-Document-OCR-Qwen3.5-2B
Image-Text-to-Text β’ 2B β’ Updated β’ 248 β’ 1 -
loay/English-Document-OCR-Qwen3.5-0.8B
Image-Text-to-Text β’ 0.8B β’ Updated β’ 773 β’ 8