OneVL series vision-language models
Xiaomi Research
community
AI & ML interests
None defined yet.
Recent Activity
Papers
Discrete-WAM: Unified Discrete Vision-Action Token Editing for World-Policy Learning
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation
models 21
xiaomi-research/OneVL_mlp_NAVSIM
14B • Updated • 74
xiaomi-research/Baseline_cot_NAVSIM
570k • Updated • 15
xiaomi-research/Baseline_answer_NAVSIM
570k • Updated • 16
xiaomi-research/OneVL_visual_decoder_pt_ar1
Image-Text-to-Text • 5B • Updated • 34
xiaomi-research/OneVL_visual_decoder_pt
Image-Text-to-Text • 5B • Updated • 70
xiaomi-research/OneVL_ROADWork
Image-Text-to-Text • 14B • Updated • 45
xiaomi-research/OneVL_NAVSIM
Image-Text-to-Text • 14B • Updated • 306
xiaomi-research/OneVL_Impromptu
Image-Text-to-Text • 14B • Updated • 31
xiaomi-research/OneVL_AlpamayoR1
Image-Text-to-Text • 14B • Updated • 73
xiaomi-research/TTS-PRISM-7B
Audio Classification • 8B • Updated • 31
datasets 0
None public yet