Mu Cai
mucai
AI & ML interests
Computer Vision, Deep Learning, 3D Vision, Vision and Language,
Recent Activity
upvoted a paper about 15 hours ago
MementoGUI: Learning Agentic Multimodal Memory Control for Long-Horizon GUI Agents upvoted a paper 2 days ago
From Plans to Pixels: Learning to Plan and Orchestrate for Open-Ended Image Editing upvoted a paper about 2 months ago
MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models