Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Nan Wang's picture

Nan Wang

nan

bwang0911's profile picture

Relic-Yuexi's profile picture

21world's profile picture

·

nanwang_t
nan-wang

AI & ML interests

None yet

Organizations

nan 's collections 6

document understanding

ByteDance/Dolphin

Image-Text-to-Text • 0.4B • Updated Jul 16, 2025 • 4.4k • 515

Multimodal LLM Datasets

A collection of the multimodal LLM datasets

haonan3/V1-33K-Old

Viewer • Updated Mar 22, 2025 • 31.8k • 1.02k • 4

Multimodal Evaluation

MMInstruction/ArxivQA

Viewer • Updated Mar 5, 2024 • 100k • 225 • 38
lmms-lab/DocVQA

Viewer • Updated Apr 18, 2024 • 16.6k • 29.7k • 84
vidore/shiftproject_test_captioning

Viewer • Updated Jun 20, 2025 • 2.05k • 19
vidore/syntheticDocQA_government_reports_test

Viewer • Updated Jun 20, 2025 • 1k • 1.67k • 1

ml-question-corpus

joey234/mmlu-machine_learning-neg-prepend

Viewer • Updated Aug 23, 2023 • 117 • 14 • 1
joey234/mmlu-machine_learning-verbal-neg-prepend

Viewer • Updated Apr 27, 2023 • 112 • 11 • 1
win-wang/Machine_Learning_QA_Collection

Viewer • Updated Sep 25, 2024 • 12.4k • 31 • 8
efeno/colpali_training_machine_learning

Viewer • Updated Aug 16, 2024 • 723 • 9

Document Embeddings

openbmb/VisRAG-Ret

Feature Extraction • 3B • Updated Nov 4, 2024 • 1.08k • 73
vidore/colpali

Visual Document Retrieval • Updated Nov 24, 2025 • 7.57k • 482

Document Embedding Datasets & Models

bevaya/ScreenSpot

Viewer • Updated Apr 10, 2024 • 1.27k • 3.11k • 50
osunlp/Multimodal-Mind2Web

Viewer • Updated Jun 5, 2024 • 14.2k • 6.07k • 96
cjfcsjt/AITW_General

Viewer • Updated May 4, 2024 • 100k • 692 • 2
microsoft/OmniParser

Image-Text-to-Text • Updated Dec 2, 2024 • 268 • 1.71k

document understanding

ByteDance/Dolphin

Image-Text-to-Text • 0.4B • Updated Jul 16, 2025 • 4.4k • 515

ml-question-corpus

joey234/mmlu-machine_learning-neg-prepend

Viewer • Updated Aug 23, 2023 • 117 • 14 • 1
joey234/mmlu-machine_learning-verbal-neg-prepend

Viewer • Updated Apr 27, 2023 • 112 • 11 • 1
win-wang/Machine_Learning_QA_Collection

Viewer • Updated Sep 25, 2024 • 12.4k • 31 • 8
efeno/colpali_training_machine_learning

Viewer • Updated Aug 16, 2024 • 723 • 9

Multimodal LLM Datasets

A collection of the multimodal LLM datasets

haonan3/V1-33K-Old

Viewer • Updated Mar 22, 2025 • 31.8k • 1.02k • 4

Document Embeddings

openbmb/VisRAG-Ret

Feature Extraction • 3B • Updated Nov 4, 2024 • 1.08k • 73
vidore/colpali

Visual Document Retrieval • Updated Nov 24, 2025 • 7.57k • 482

Multimodal Evaluation

MMInstruction/ArxivQA

Viewer • Updated Mar 5, 2024 • 100k • 225 • 38
lmms-lab/DocVQA

Viewer • Updated Apr 18, 2024 • 16.6k • 29.7k • 84
vidore/shiftproject_test_captioning

Viewer • Updated Jun 20, 2025 • 2.05k • 19
vidore/syntheticDocQA_government_reports_test

Viewer • Updated Jun 20, 2025 • 1k • 1.67k • 1

Document Embedding Datasets & Models

bevaya/ScreenSpot

Viewer • Updated Apr 10, 2024 • 1.27k • 3.11k • 50
osunlp/Multimodal-Mind2Web

Viewer • Updated Jun 5, 2024 • 14.2k • 6.07k • 96
cjfcsjt/AITW_General

Viewer • Updated May 4, 2024 • 100k • 692 • 2
microsoft/OmniParser

Image-Text-to-Text • Updated Dec 2, 2024 • 268 • 1.71k

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs