HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published 7 days ago • 174
Running 120 Qwen3.5 Omni Offline Demo 🌍 120 Chat with a multimodal AI using text, images, audio, or video
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated 9 days ago • 589k • 2.64k
TADA Collection TADA: A Generative Framework for Speech Modeling via Text-Acoustic Dual Alignment | https://huggingface.co/papers/2602.23068 • 7 items • Updated 21 days ago • 70
view article Article Introducing Daggr: Chain apps programmatically, inspect visually +3 Jan 29 • 106
GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published Jan 20 • 37
Running on Zero MCP Featured 2.17k Qwen Image Edit Camera Control 🎬 2.17k Fast 4 step inference with Qwen Image Edit 2509