see later S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence Paper • 2606.20515 • Published 6 days ago • 39 paom/texture2albedo-v2 Image-to-Image • Updated 2 days ago • 389 • 30 SakanaAI/DreamCubedDP4 Updated Apr 28 • 13
S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence Paper • 2606.20515 • Published 6 days ago • 39
future PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion Paper • 2605.23902 • Published May 22 • 46 minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models Paper • 2605.30263 • Published 27 days ago • 59 From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 28 days ago • 74 open-thoughts/AgentTrove Viewer • Updated May 7 • 1.7M • 3.32k • 187
PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion Paper • 2605.23902 • Published May 22 • 46
minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models Paper • 2605.30263 • Published 27 days ago • 59
From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 28 days ago • 74
see later S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence Paper • 2606.20515 • Published 6 days ago • 39 paom/texture2albedo-v2 Image-to-Image • Updated 2 days ago • 389 • 30 SakanaAI/DreamCubedDP4 Updated Apr 28 • 13
S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence Paper • 2606.20515 • Published 6 days ago • 39
future PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion Paper • 2605.23902 • Published May 22 • 46 minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models Paper • 2605.30263 • Published 27 days ago • 59 From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 28 days ago • 74 open-thoughts/AgentTrove Viewer • Updated May 7 • 1.7M • 3.32k • 187
PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion Paper • 2605.23902 • Published May 22 • 46
minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models Paper • 2605.30263 • Published 27 days ago • 59
From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 28 days ago • 74