146 33

Wei Liu

lefutonku

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

upvoted a paper 5 days ago

SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer

liked a model 6 days ago

stable-diffusion-v1-5/stable-diffusion-v1-5

View all activity

Organizations

None yet

upvoted 2 papers 5 days ago

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

Paper • 2604.18486 • Published Apr 20 • 94

SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer

Paper • 2509.24695 • Published Sep 29, 2025 • 53

upvoted 4 papers 8 days ago

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published 12 days ago • 185

upvoted 4 papers 11 days ago

3D Gaussian Splatting for Real-Time Radiance Field Rendering

Paper • 2308.04079 • Published Aug 8, 2023 • 203

Pixal3D: Pixel-Aligned 3D Generation from Images

Paper • 2605.10922 • Published 13 days ago • 32

HumanNet: Scaling Human-centric Video Learning to One Million Hours

Paper • 2605.06747 • Published 17 days ago • 51

Flow-OPD: On-Policy Distillation for Flow Matching Models

Paper • 2605.08063 • Published 16 days ago • 97

upvoted 2 papers 14 days ago

DeepCode: Open Agentic Coding

Paper • 2512.07921 • Published Dec 8, 2025 • 35

HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness

Paper • 2605.02396 • Published 20 days ago • 23

upvoted an article 17 days ago

Article

The Annotated Diffusion Model

nielsr, kashif

•

Jun 7, 2022

• 358

upvoted 6 papers 24 days ago

SkVM: Compiling Skills for Efficient Execution Everywhere

Paper • 2604.03088 • Published Apr 6 • 10

GigaWorld-Policy: An Efficient Action-Centered World--Action Model

Paper • 2603.17240 • Published Mar 18 • 26

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published 27 days ago • 118

Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

Paper • 2604.24763 • Published 27 days ago • 71

ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning

Paper • 2604.24300 • Published 27 days ago • 67

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Paper • 2604.22748 • Published 30 days ago • 226

upvoted a paper 28 days ago

Vista4D: Video Reshooting with 4D Point Clouds

Paper • 2604.21915 • Published about 1 month ago • 12

Wei Liu

AI & ML interests

Recent Activity

Organizations

lefutonku's activity

The Annotated Diffusion Model