Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing
-
PaddlePaddle/PaddleOCR-VL-1.5
Image-Text-to-Text β’ 1.0B β’ Updated β’ 44.2k β’ 614 -
PaddleOCR-VL-1.5 Online Demo
π»73PaddleOCR-VL-1.5_Online_Demo
-
PaddlePaddle/PP-DocLayoutV3
Image Segmentation β’ Updated β’ 27.2k β’ 74 -
PaddlePaddle/PP-DocLayoutV3_safetensors
Object Detection β’ Updated β’ 313k β’ 24