ElevenLabs Multilingual 2 produces lifelike, emotionally rich speech in 29 languages with higher latency generation, starting from 15 CUs.
Models
All Models
ElevenLabs Turbo 2.5 offers low-latency text-to-speech in 32 languages with fast, cost-effective generation, starting from 8 CUs.
ElevenLabs Sound Effects 2 generates custom sound effects (SFX) from text with seamless looping, creating up to 30-second clips. Pricing starts from 2 CUs.
Sync Lipsync 2 by Sync Labs syncs mouth movements in a video to a provided audio track, producing natural-looking speech alignment. 38 CU
Kling Lipsync by Kuaishou applies accurate lip-sync to an existing video using a separate audio track, focusing solely on facial animation.
Google Lyria 2 generates high-fidelity instrumental music from text, creating up to 30-second 48kHz clips. Pricing starts from 12 CUs.
A high-resolution model with Text-to-Video (T2V) support and First/Last Frame workflows for precise cinematic control and multi-subject handling.
Kling 2.1 (Pro) by Kuaishou is an I2V model with enhanced sharpness, lighting, and both first- and last-frame conditioning for precise 1080p video transitions.
This Flux LoRA creates stylized 3D fantasy characters. The style blends digital illustration with sculpted forms, featuring detailed armor and dragon companions.
This Flux LoRA generates 3D cartoon monsters for RPGs. It features stylized creatures with glowing eyes and armor on simple bases using bold, saturated colors.
This Flux LoRA generates fantasy backdrops like ruins and arenas. It features a painterly style with dramatic lighting, soft gradients, and vibrant colors.
Scenario Flux Upscale increases image resolution while letting you control whether to keep the original look or add more creative detail.
Topaz Image Upscale by Topaz Labs intelligently enhances image resolution while preserving natural textures. Pricing from 15 credits.
This Flux LoRA produces contemporary animated film aesthetics. It features cinematic lighting and smooth forms to create expressive characters and scenes.
This Flux LoRA produces dynamically posed characters with a toy-like aesthetic. It creates bold, collectible-style visuals resembling classic action figures.
This Flux LoRA generates game UI elements and mockups. It produces buttons, HUDs, and menus in styles ranging from sleek sci-fi to ornate fantasy layouts.
This Flux LoRA generates retro RPG environments from a classic top-down perspective. It creates tiled worlds, villages, and dungeons with a vintage game feel.
This Flux LoRA replicates the vintage Franco-Belgian comic style. It produces clean lines and expressive characters found in classic European bandes dessinees.
This Flux LoRA produces 3D cartoon vehicles with retro silhouettes. It features matte textures, pastel tones, and playful details like oversized wheels.
This Flux LoRA generates vibrant game UI icons in a casual cartoon style. It produces buttons and badges with bold colors, glowing effects, and shading.
This Flux LoRA generates fantasy UI frames for mobile games. It produces ornate, magical, and metallic borders for menus and dialogue boxes in RPG styles.
This Flux LoRA generates reward boxes and crates for game UI. It produces fantasy chests and sci-fi loot containers in styles like steampunk or cyberpunk.
This Flux LoRA generates clean, stylized 3D objects for puzzle games. It produces isolated items or grids, featuring simplified details and dark backgrounds.
This Flux LoRA generates scenes with two distinct characters: a red-haired heroine and a green zombie. It maintains separate features to avoid visual bleeding.
This Flux LoRA generates a retro hero boy in neon outfits. The style reflects 80s arcade gaming with digital helmets, glowing visors, and futuristic elements.
This Flux LoRA generates modular RPG assets and strategy game props. It produces stylized stone arches, market stalls, and greenery for building environments.
This Flux LoRA generates vibrant, cartoonish battlegrounds for RPGs. It produces fantasy environments like volcanic pits and forests with bold, clean details.
This Flux LoRA produces fantasy creatures in a modern animated style. It features smooth brushstrokes, soft shading, and dynamic lighting on black backgrounds.
This Flux LoRA produces glossy, 3D icons with rounded shapes. It creates vibrant UI elements and characters with a polished, soft style and bold proportions.
This Flux LoRA generates whimsical architectural structures in isometric views. It produces detailed fantasy buildings with clean lines and bright colors.
This Flux LoRA produces clean, modern 3D assets with smooth surfaces and rounded edges. It creates minimalist props and vehicles for mobile apps or games.
This Flux LoRA creates anthropomorphized objects with expressive faces. It applies a vibrant cartoon style to furniture, food, and tools using wide eyes.
Luma Video Reframe (May 2025) adjusts the aspect ratios of a video by outpainting beyond the original frame while maintaining visual coherence and scene logic
Dreamina 3.1 by ByteDance generates images with enhanced aesthetics and diverse artistic styles. Pricing from 10 CU.
deogram Character by Ideogram is a specialized model for maintaining character consistency from a single reference image.
Qwen Image by Alibaba is a foundational text-to-image model with strong text rendering and artistic style support. Generation from 4 CU.
A model to upscale skyboxes while preserving the seamless aspect
A model to upscale textures & materials while preserving their seamless aspect
Flux-based upscaling model by Scenario
Aleph by Runway is a context-aware video editing model capable of scene reconstruction and novel camera angle generation.
Wan 2.2 T2V is an open-source text-to-video model by Alibaba with a Mixture-of-Experts architecture and Last Frame conditioning. From 15 CU.
FLUX 1 Krea (Dev): An "opinionated" model emphasizing natural textures and diverse aesthetics to avoid the "AI look." Pricing from 4 CU.
Wan 2.2 I2V is an open-source image-to-video model by Alibaba with a Mixture-of-Experts architecture and Last Frame conditioning. From 35 CU.
Runway Upscale V1 by Runway enhances generated video resolution to 4K while maintaining temporal consistency across frames.
Runway Gen4 Turbo by Runway is an early 2025 high-speed image-to-video generation model optimized for low-latency iteration. First frame is required. 36 CU
PartCrafter by PKU: A compositional DiT model generating up to 16 semantically distinct, separate 3D meshes from a single RGB image.
MiniMax Video 02 (Hailuo 02) by MiniMax produces realistic video with natural human movement. Pricing from 72 credits per generation.
MiniMax Image-01 (Apr 2025) specializes in complex lighting effects like sub-surface scattering for highly realistic skin.
Recraft v3 SVG by Recraft generates scalable vector graphics for logos and icons. Pricing from 9 credits.