stablegravity 's Collections checkitoutlater
updated
MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with
Mixture of Score Guidance
Paper
• 2412.05355
• Published • 8
SwiftEdit: Lightning Fast Text-Guided Image Editing via One-Step
Diffusion
Paper
• 2412.04301
• Published • 40
PanoDreamer: 3D Panorama Synthesis from a Single Image
Paper
• 2412.04827
• Published • 10
Around the World in 80 Timesteps: A Generative Approach to Global Visual
Geolocation
Paper
• 2412.06781
• Published • 23
From Elements to Design: A Layered Approach for Automatic Graphic Design
Composition
Paper
• 2412.19712
• Published • 15
MetaChain: A Fully-Automated and Zero-Code Framework for LLM Agents
Paper
• 2502.05957
• Published • 15
Expect the Unexpected: FailSafe Long Context QA for Finance
Paper
• 2502.06329
• Published • 133
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic
Understanding, Localization, and Dense Features
Paper
• 2502.14786
• Published • 161
X-Dancer: Expressive Music to Human Dance Video Generation
Paper
• 2502.17414
• Published • 14
MagicInfinite: Generating Infinite Talking Videos with Your Words and
Voice
Paper
• 2503.05978
• Published • 36
Motion Anything: Any to Motion Generation
Paper
• 2503.06955
• Published • 35
InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
Paper
• 2503.16418
• Published • 36
Paper
• 2503.14378
• Published • 61
LeX-Art: Rethinking Text Generation via Scalable High-Quality Data
Synthesis
Paper
• 2503.21749
• Published • 26
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual
Scenes
Paper
• 2503.23461
• Published • 94
MoCha: Towards Movie-Grade Talking Character Synthesis
Paper
• 2503.23307
• Published • 141
UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes
Paper
• 2505.23253
• Published • 4
How Animals Dance (When You're Not Looking)
Paper
• 2505.23738
• Published • 3
Sherlock: Self-Correcting Reasoning in Vision-Language Models
Paper
• 2505.22651
• Published • 48
Paper2Poster: Towards Multimodal Poster Automation from Scientific
Papers
Paper
• 2505.21497
• Published • 109
OmniConsistency: Learning Style-Agnostic Consistency from Paired
Stylization Data
Paper
• 2505.18445
• Published • 63
EgoX: Egocentric Video Generation from a Single Exocentric Video
Paper
• 2512.08269
• Published • 122
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Paper
• 2512.11253
• Published • 39
OmniPSD: Layered PSD Generation with Diffusion Transformer
Paper
• 2512.09247
• Published • 49
Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization
Paper
• 2601.05432
• Published • 169
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation
Paper
• 2512.23576
• Published • 66