Video

Omni Human 1.5

Omni Human 1.5 by ByteDance animates a single photo into a talking avatar, syncing lip movements, expressions, and natural gestures to your audio.

Omni Human 1.5 is ByteDance's closed-source image-to-video model (Oct 2025), specialized for digital human generation. It creates realistic talking avatars from a single image and audio track, with optional text prompts for scene control. The model prioritizes expressive lip-sync accuracy and character animation coherence over general video generation, making it effective for talking avatars and educational content. It analyzes audio to produce animations synchronized with speech rhythm. While unavailable on Scenario, the API is priced from $0.16 per generation.

More models from Bytedance

Seedream 5.0 Pro
Use it ↗Image
Seed Audio 1.0
Use it ↗Audio
Seedance 2.0 Mini
Use it ↗Video
Seed Audio 1.0 Multilingual
Use it ↗Audio
Seedance 2.0 Fast
Use it ↗Video
Seedance 2.0
Use it ↗Video
Seedream 5.0 Lite
Use it ↗Image
Seedance 1.5 Pro
Use it ↗Video
Seedream 4.5
Use it ↗Image
SeedVR2 - Image Upscale
Use it ↗Image
SeedVR2 - Video Upscale
Use it ↗Video
Dreamina 3.1
Use it ↗Image

Omni Human 1.5

More models from Bytedance

Seedream 5.0 Pro

Seed Audio 1.0

Seedance 2.0 Mini

Seed Audio 1.0 Multilingual

Seedance 2.0 Fast

Seedance 2.0

Seedream 5.0 Lite

Seedance 1.5 Pro

Seedream 4.5

SeedVR2 - Image Upscale

SeedVR2 - Video Upscale

Dreamina 3.1