Voxtral Realtime 4B
Speech-to-Text in the browser with transformers.js + WebGPU
Speech-to-Text in the browser with transformers.js + WebGPU
Real-time speech transcription, entirely in your browser.
Generate realistic speech audio from text in chosen or custom voice
A cutting-edge speech generation model with stereo support
Controllable TTS via instruction prompting (JPN / Anime)
FireRed-Image-Edit ร Qwen-Image-Edit-Rapid (Transformers)
FireRed-OCR for Document Recognition
Generate speech audio from text with custom or cloned voices