IndexTTS 2 Demo
π’
782
Generate expressive speech audio from text with emotion control
Run a web-based user interface
Generate speech from text using a reference audio
Generate speech from text using a reference voice
Expressive Zeroshot TTS
An Agentic Framework with Tools for Complex Reasoning
Conversational speech generation
Restore and enhance faces in photos
Audio-based video editing using AI-generated transcription