TTS — Streaming
Streams PCM from the model and plays it as it arrives. Latency = time to first audio.
Speaker
Emotion
Language
English
Hindi
Text
Romanize output to Hinglish (print the transliterated text)
▶ Generate
Latency
—
Duration
—
RTF
—
Status
—
▶
0:00
/
0:00
Output text
Speakers and emotions are configured in this file. Audio is 16-bit mono PCM @ 24 kHz.