Multi-modal Multilingual Instruction

university

https://m3-it.github.io

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

tobiaslee authored a paper about 2 months ago

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

yuweiyin authored a paper 2 months ago

IntentGrasp: A Comprehensive Benchmark for Intent Understanding

yuweiyin submitted a paper 2 months ago

IntentGrasp: A Comprehensive Benchmark for Intent Understanding

View all activity

Collections 1

spaces 1

VL RewardBench

Explore vision-language model performance on VL-RewardBench

models 9

MMInstruction/Qwen2-VL-72B-Video-T3

73B • Updated Dec 23, 2024

MMInstruction/Giraffe

8B • Updated Dec 17, 2024 • 2

MMInstruction/LongVA-7B-Video-T3

8B • Updated Oct 26, 2024 • 3

MMInstruction/Qwen-VL-ArXivCap

Text Generation • Updated May 6, 2024 • 55 • 4

MMInstruction/Qwen-VL-ArXivQA

Text Generation • Updated May 6, 2024 • 86 • 4

MMInstruction/Silkie

Text Generation • Updated Dec 20, 2023 • 53 • 12

MMInstruction/YingVLM

Updated Aug 16, 2023 • 2 • 1

MMInstruction/YingVLM-zh

Updated Aug 10, 2023 • 2

MMInstruction/YingVLM-Video

Updated Aug 10, 2023 • 1

datasets 17

MMInstruction/stock_factors

Viewer • Updated Dec 8, 2025 • 48.2M • 238 • 3

MMInstruction/OSWorld-G

Viewer • Updated May 22, 2025 • 510 • 173 • 6

MMInstruction/VL-RewardBench

Viewer • Updated May 19, 2025 • 1.25k • 462 • 15

MMInstruction/Video-T3-QA

Viewer • Updated Feb 24, 2025 • 162k • 60 • 2

MMInstruction/SuperClevr_Val

Viewer • Updated Feb 18, 2025 • 5k • 7 • 1

MMInstruction/Clevr_CoGenT_TrainA_R1

Viewer • Updated Feb 13, 2025 • 37.8k • 147 • 48

MMInstruction/Clevr_CoGenT_TrainA_70K_Complex

Viewer • Updated Feb 5, 2025 • 70k • 1.96k • 8

MMInstruction/Clevr_CoGenT_ValB

Viewer • Updated Feb 3, 2025 • 5k • 38 • 2

MMInstruction/Clevr_CoGenT_ValA

Viewer • Updated Feb 3, 2025 • 5k • 629 • 1

MMInstruction/Clevr_CoAgent_TrainA_R1

Viewer • Updated Feb 2, 2025 • 2.5k • 123 • 1

View 17 datasets