document understanding ByteDance/Dolphin Image-Text-to-Text • 0.4B • Updated Jul 16, 2025 • 4.4k • 515
Multimodal LLM Datasets A collection of the multimodal LLM datasets haonan3/V1-33K-Old Viewer • Updated Mar 22, 2025 • 31.8k • 1.02k • 4
Multimodal Evaluation MMInstruction/ArxivQA Viewer • Updated Mar 5, 2024 • 100k • 225 • 38 lmms-lab/DocVQA Viewer • Updated Apr 18, 2024 • 16.6k • 29.7k • 84 vidore/shiftproject_test_captioning Viewer • Updated Jun 20, 2025 • 2.05k • 19 vidore/syntheticDocQA_government_reports_test Viewer • Updated Jun 20, 2025 • 1k • 1.67k • 1
ml-question-corpus joey234/mmlu-machine_learning-neg-prepend Viewer • Updated Aug 23, 2023 • 117 • 14 • 1 joey234/mmlu-machine_learning-verbal-neg-prepend Viewer • Updated Apr 27, 2023 • 112 • 11 • 1 win-wang/Machine_Learning_QA_Collection Viewer • Updated Sep 25, 2024 • 12.4k • 31 • 8 efeno/colpali_training_machine_learning Viewer • Updated Aug 16, 2024 • 723 • 9
Document Embeddings openbmb/VisRAG-Ret Feature Extraction • 3B • Updated Nov 4, 2024 • 1.08k • 73 vidore/colpali Visual Document Retrieval • Updated Nov 24, 2025 • 7.57k • 482
Document Embedding Datasets & Models bevaya/ScreenSpot Viewer • Updated Apr 10, 2024 • 1.27k • 3.11k • 50 osunlp/Multimodal-Mind2Web Viewer • Updated Jun 5, 2024 • 14.2k • 6.07k • 96 cjfcsjt/AITW_General Viewer • Updated May 4, 2024 • 100k • 692 • 2 microsoft/OmniParser Image-Text-to-Text • Updated Dec 2, 2024 • 268 • 1.71k
document understanding ByteDance/Dolphin Image-Text-to-Text • 0.4B • Updated Jul 16, 2025 • 4.4k • 515
ml-question-corpus joey234/mmlu-machine_learning-neg-prepend Viewer • Updated Aug 23, 2023 • 117 • 14 • 1 joey234/mmlu-machine_learning-verbal-neg-prepend Viewer • Updated Apr 27, 2023 • 112 • 11 • 1 win-wang/Machine_Learning_QA_Collection Viewer • Updated Sep 25, 2024 • 12.4k • 31 • 8 efeno/colpali_training_machine_learning Viewer • Updated Aug 16, 2024 • 723 • 9
Multimodal LLM Datasets A collection of the multimodal LLM datasets haonan3/V1-33K-Old Viewer • Updated Mar 22, 2025 • 31.8k • 1.02k • 4
Document Embeddings openbmb/VisRAG-Ret Feature Extraction • 3B • Updated Nov 4, 2024 • 1.08k • 73 vidore/colpali Visual Document Retrieval • Updated Nov 24, 2025 • 7.57k • 482
Multimodal Evaluation MMInstruction/ArxivQA Viewer • Updated Mar 5, 2024 • 100k • 225 • 38 lmms-lab/DocVQA Viewer • Updated Apr 18, 2024 • 16.6k • 29.7k • 84 vidore/shiftproject_test_captioning Viewer • Updated Jun 20, 2025 • 2.05k • 19 vidore/syntheticDocQA_government_reports_test Viewer • Updated Jun 20, 2025 • 1k • 1.67k • 1
Document Embedding Datasets & Models bevaya/ScreenSpot Viewer • Updated Apr 10, 2024 • 1.27k • 3.11k • 50 osunlp/Multimodal-Mind2Web Viewer • Updated Jun 5, 2024 • 14.2k • 6.07k • 96 cjfcsjt/AITW_General Viewer • Updated May 4, 2024 • 100k • 692 • 2 microsoft/OmniParser Image-Text-to-Text • Updated Dec 2, 2024 • 268 • 1.71k