VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated Mar 2 • 244
Starting on A10G Featured 241 faster-qwen3-tts 🎙 241 Generate natural speech from text or voice samples
Running on Zero Agents Featured 472 Parakeet-TDT-0.6b-V2 472 Transcribe audio files with timestamps and downloadable subtitles
Running on CPU Upgrade Featured 3.19k The Smol Training Playbook 📚 3.19k The secrets to building world-class LLMs
Running Featured 93 Parakeet STT Progressive Transcription 🎤 93 Transcribe speech to text instantly with WebGPU acceleration
openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 8.05M • • 3.04k
SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations Paper • 2108.01073 • Published Aug 2, 2021 • 9
Running on Zero Agents Featured 139 Qwen3-ASR Demo 🎙 139 Transcribe audio to text with timestamps and visualization
Running on CPU Upgrade Agents 1.79k Omni Image Editor 🖼 1.79k Image edit, text to image, image upscale, remove watermark
Running on Zero Agents Featured 1.94k Qwen3-TTS Demo 🎙 1.94k Generate custom speech from text, voice descriptions, or samples