6 33 26

Zuhao Yang

mwxely

https://mwxely.github.io/

AI & ML interests

Large Multimodal Models

Recent Activity

authored a paper about 9 hours ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

authored a paper about 9 hours ago

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

upvoted a paper about 18 hours ago

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

View all activity

Organizations

authored 2 papers about 9 hours ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

Paper • 2603.15726 • Published Mar 16 • 186

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

Paper • 2605.20342 • Published 3 days ago

upvoted a paper about 18 hours ago

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

Paper • 2605.19833 • Published 3 days ago • 117

updated a dataset about 23 hours ago

ParaVT/ParaVT-Source

Updated about 18 hours ago • 592 • 2

updated a model about 23 hours ago

ParaVT/ParaVT-8B

Video-Text-to-Text • 9B • Updated about 18 hours ago • 65 • 3

updated a dataset about 23 hours ago

ParaVT/ParaVT-Parquet

Viewer • Updated about 18 hours ago • 101k • 87 • 2

liked a model 4 days ago

ParaVT/ParaVT-8B

Video-Text-to-Text • 9B • Updated about 18 hours ago • 65 • 3

liked 2 datasets 4 days ago

ParaVT/ParaVT-Source

Updated about 18 hours ago • 592 • 2

ParaVT/ParaVT-Parquet

Viewer • Updated about 18 hours ago • 101k • 87 • 2

published 2 datasets 4 days ago

ParaVT/ParaVT-Source

Updated about 18 hours ago • 592 • 2

ParaVT/ParaVT-Parquet

Viewer • Updated about 18 hours ago • 101k • 87 • 2

published a model 4 days ago

ParaVT/ParaVT-8B

Video-Text-to-Text • 9B • Updated about 18 hours ago • 65 • 3

authored a paper 9 days ago

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Paper • 2604.28123 • Published 21 days ago • 48

authored a paper 10 days ago

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

Paper • 2605.10434 • Published 11 days ago • 30

upvoted a paper 10 days ago

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

Paper • 2605.10434 • Published 11 days ago • 30

upvoted a paper 16 days ago

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Paper • 2604.28123 • Published 21 days ago • 48

authored a paper 18 days ago

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published 22 days ago • 90

commented 2 papers 21 days ago

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published 22 days ago • 90 •

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published 22 days ago • 90 •

upvoted a paper 21 days ago

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published 22 days ago • 90

Zuhao Yang

AI & ML interests

Recent Activity

Organizations

mwxely's activity