Weiming Ren
wren93
AI & ML interests
Multimodal Understanding, Generative Modelling
Recent Activity
authored a paper 28 days ago
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation upvoted a paper about 1 month ago
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling