arxiv:2604.24763
Weiming Ren
wren93
AI & ML interests
Multimodal Understanding, Generative Modelling
Recent Activity
authored a paper 27 days ago
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation upvoted a paper about 1 month ago
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling