arxiv:2412.18426
Tony Zhao PRO
tianchez
AI & ML interests
Multimodal Agent, Generative AI
Recent Activity
upvoted an article about 14 hours ago
VLX-Seek: Improving VLM Fine-Grained Perception via Region Reference Instead of Coordinate Generation published an article about 15 hours ago
VLX-Seek: Improving VLM Fine-Grained Perception via Region Reference Instead of Coordinate Generation upvoted an article 1 day ago
VLX-Flow: Continuous Video Understanding for Real-Time Multimodal Interaction