OpenAI's premier video model for pro-quality results. Creates cinematic 1080p videos with synced audio from text, image, or audio inputs. From 168 CU.
Models
All Models
Hunyuan Image 3 by Tencent is an 80B parameter text-to-image model featuring a Mixture-of-Experts (MoE) architecture for high-resolution visual synthesis.
Rodin Gen-2 by Deemos: is a 10B parameter model generating high-poly quad meshes from single or multi-view images.
SeedVR2 by SeedVR is a high-quality super-resolution model that scales images to 4K with texture refinement. Pricing from 5 credits.
SeedVR2 Upscale Image by ByteDance is a one-step diffusion upscaler for high-detail restoration up to 16MP. From 5 credits.
Omni Human 1.5 by ByteDance creates realistic talking avatars from a single image and audio with film-grade lip-sync. Pricing from $0.16/generation.
Wan 2.5 I2V is an open-source model by Alibaba, animating images into 1080p videos with synchronized audio in one pass. Generation from 55 CU.
Short Description: Wan 2.5 T2V is an open-source model by Alibaba, creating 1080p videos with synchronized audio in a single pass. Generation from 55 CU.
Kling 2.5 T2V (Pro) by Kuaishou is a text-to-video model for 1080p output, engineered for faster generation speeds. Pricing from 55 CU.
Kling 2.5 I2V (Pro) by Kuaishou is an image-to-video model for 1080p output, engineered for faster generation speeds. Pricing from 55 CU.
Wan 2.2 Animate (Replace) is a mode of an open-source model by Alibaba that swaps a person in a video with a character image.
Wan 2.2 Animate (Move) is a mode of an open-source model by Alibaba that animates a static character using motion from a video.
Wan 2.2 Reframe is an open-source AI video editor by Alibaba that intelligently changes video aspect ratios with content-aware framing. Unavailable.
Lucy Edit (Pro) by Decart provides stable, production-ready video transformations including wardrobe and scene replacements via text.
Lucy Edit (Dev) by Decart is a developmental editing model optimized for rapid experimentation and lightweight prototyping.
Wan 2.2 Outpainting is an open-source AI video editor by Alibaba that expands video frames by generating new, context-aware content.
HeyGen Video Translate by HeyGen localizes video into 70+ languages with cloned voices and pixel-level lip-sync. Currently unavailable.
Sync Lipsync 2 is a video-to-video model that syncs mouth movements in a video to a provided audio track, producing natural-looking speech alignment, up to 4K resolution.
Luma Modify Video by Luma AI transforms existing footage with environment and texture swapping from around 0 credits.
ElevenLabs 3 (Alpha) generates highly expressive, emotional speech in over 70 languages with non-real-time generation, starting from 15 CUs.
A specialized model to synchronize an audio track with a speaker's mouth movements in a video, creating realistic, high-quality dialogue.
Meta MusicGen generates music from text and can be guided by a reference melody, creating up to 30-second clips. Pricing starts from 15 CUs.
Creatify Lipsync by Creatify produces realistic mouth-movement animations for social-first marketing videos.
ElevenLabs Multilingual 2 produces lifelike, emotionally rich speech in 29 languages with higher latency generation, starting from 15 CUs.
ElevenLabs Turbo 2.5 offers low-latency text-to-speech in 32 languages with fast, cost-effective generation, starting from 8 CUs.
ElevenLabs Sound Effects 2 generates custom sound effects (SFX) from text with seamless looping, creating up to 30-second clips. Pricing starts from 2 CUs.
Sync Lipsync 2 by Sync Labs syncs mouth movements in a video to a provided audio track, producing natural-looking speech alignment. 38 CU
Kling Lipsync by Kuaishou applies accurate lip-sync to an existing video using a separate audio track, focusing solely on facial animation.
Google Lyria 2 generates high-fidelity instrumental music from text, creating up to 30-second 48kHz clips. Pricing starts from 12 CUs.
A high-resolution model with Text-to-Video (T2V) support and First/Last Frame workflows for precise cinematic control and multi-subject handling.
Kling 2.1 (Pro) by Kuaishou is an I2V model with enhanced sharpness, lighting, and both first- and last-frame conditioning for precise 1080p video transitions.
This Flux LoRA creates stylized 3D fantasy characters. The style blends digital illustration with sculpted forms, featuring detailed armor and dragon companions.
This Flux LoRA generates 3D cartoon monsters for RPGs. It features stylized creatures with glowing eyes and armor on simple bases using bold, saturated colors.
This Flux LoRA generates fantasy backdrops like ruins and arenas. It features a painterly style with dramatic lighting, soft gradients, and vibrant colors.
Scenario Flux Upscale increases image resolution while letting you control whether to keep the original look or add more creative detail.
Topaz Image Upscale by Topaz Labs intelligently enhances image resolution while preserving natural textures. Pricing from 15 credits.
This Flux LoRA produces contemporary animated film aesthetics. It features cinematic lighting and smooth forms to create expressive characters and scenes.
This Flux LoRA produces dynamically posed characters with a toy-like aesthetic. It creates bold, collectible-style visuals resembling classic action figures.
This Flux LoRA generates game UI elements and mockups. It produces buttons, HUDs, and menus in styles ranging from sleek sci-fi to ornate fantasy layouts.
This Flux LoRA generates retro RPG environments from a classic top-down perspective. It creates tiled worlds, villages, and dungeons with a vintage game feel.
This Flux LoRA replicates the vintage Franco-Belgian comic style. It produces clean lines and expressive characters found in classic European bandes dessinees.
This Flux LoRA produces 3D cartoon vehicles with retro silhouettes. It features matte textures, pastel tones, and playful details like oversized wheels.
This Flux LoRA generates vibrant game UI icons in a casual cartoon style. It produces buttons and badges with bold colors, glowing effects, and shading.
This Flux LoRA generates fantasy UI frames for mobile games. It produces ornate, magical, and metallic borders for menus and dialogue boxes in RPG styles.
This Flux LoRA generates reward boxes and crates for game UI. It produces fantasy chests and sci-fi loot containers in styles like steampunk or cyberpunk.
This Flux LoRA generates clean, stylized 3D objects for puzzle games. It produces isolated items or grids, featuring simplified details and dark backgrounds.