Scenario
← All Models
Audio

Minimax Speech 2.8 Turbo

Use it ↗

Fast text-to-speech with the same 17 voices, emotions, and 40+ languages as HD, optimized for speed and cost. For real-time apps, assistants, or interactive content.

Speed-optimized text-to-speech delivering natural, expressive audio at lower cost and latency. Same 17 voice presets, 10 emotion modes, and 40+ language support as HD. Full control over speed, pitch, volume, and pause timing. Supports all 19 interjection tags for lifelike delivery — laughs, sighs, gasps, breath, and more. Sub-250ms latency makes it ideal for real-time voice assistants, interactive games, and live applications. Outputs stereo or mono at up to 44.1kHz. 40% cheaper per generation than HD while maintaining high quality across all supported languages and voices.

More models from MiniMax