Audio

Seed Audio 1.0

Generate expressive speech with a prompt-defined voice (age, accent, mood, character...) plus optional audio/image references and controls for speed, pitch, or volume.

Audio
BytePlus
Bytedance
Seed Audio
Text to Speech
preview

Seed Audio 1.0 by ByteDance turns text into expressive, recorded-sounding speech, with the voice defined however you like. Describe it in plain language (age, accent, mood, or character). Clone it from up to three short audio clips, or derive it from a single reference image. Built-in emotional range, natural pauses, and emphasis keep long scripts consistent from the first word to the last. Fine-tune speech rate, pitch, loudness, and sample rate, then synthesize across multiple languages and accents.

More models from Bytedance

Seedance 2.0 Fast
Use it ↗Video
Seedance 2.0
Use it ↗Video
Seedance 2.0 Mini
Use it ↗Video
Seedream 5.0 Lite
Use it ↗Image
Seedance 1.5 Pro
Use it ↗Video
Seedream 4.5
Use it ↗Image
SeedVR2 - Image Upscale
Use it ↗Image
SeedVR2 - Video Upscale
Use it ↗Video
Omni Human 1.5
Use it ↗Video
Dreamina 3.1
Use it ↗Image
Seedream 4.0
Use it ↗Image
Seedance 1 (Pro Fast)
Use it ↗Video

Seed Audio 1.0

More models from Bytedance

Seedance 2.0 Fast

Seedance 2.0

Seedance 2.0 Mini

Seedream 5.0 Lite

Seedance 1.5 Pro

Seedream 4.5

SeedVR2 - Image Upscale

SeedVR2 - Video Upscale

Omni Human 1.5

Dreamina 3.1

Seedream 4.0

Seedance 1 (Pro Fast)