Kling O1 Reference Images by Kuaishou generates video using multiple images to define a character or object, ensuring consistency. Pricing from 90 CU.
Models
All Models
Creatify Aurora by Creatify is an image-to-video model for ultra-realistic reactive avatars with emotive gestures. Currently unavailable.
Sync Lipsync React-1 by Sync Labs provides audio-driven lip-sync with integrated emotion presets and head movement control.
FLUX 1.1 (Pro) by Black Forest Labs has enhanced performance with improved speed and detail over the original Flux 1 series. Pricing from 6 CU.
FLUX 1.1 (Pro Ultra) by Black Forest Labs does 4MP resolution with a specialized "Raw" mode for candid, authentic photography styles. Pricing from 9 CU.
Kling AI Avatar 2 (Pro) by Kuaishou generates a lifelike talking video from a single static image and an audio track, focusing on precise lip-sync.
Kling 2.6 I2V (Pro) by Kuaishou animates images into 1080p video with synchronized native audio and advanced motion. Pricing from 105 CU.
Kling 2.6 T2V (Pro) by Kuaishou generates 1080p video with synchronized native audio directly from text prompts. Pricing from 105 CU.
Kling O1 I2V by Kuaishou creates 1080p video from images with precise start/end frame control for structured animation. Pricing from 90 CU.
This Scenario model creates voxel-based 3D models from prompts with simple controls for size and scale.
Grid Maker by Scenario organizes multiple images into a clean, adjustable grid layout for a unified and clear visual overview.
Scenario Video to Image Sequence converts video footage into high-quality, frame-by-frame image sequences with customizable intervals for seamless asset generation.
Sequence-to-Video by Scenario turns a collection of images into a video file with settings to control the playback speed, looping style, and sound.
Scenario Gemini Upscale by Google uses multimodal reasoning to sharpen images while balancing structure and creativity from 36 credits.
This Flux LoRA generates sci-fi landscapes using bold lines and cartoon aesthetics. It produces any landscape or environment with smooth shading.
This LoRA generates glossy, paint-like surreal scenes with dripping liquids, swirling textures, vibrant florals, and candy-colored spheres in dynamic abstract spaces.
This Flux LoRA generates stylized 3D cartoon characters like robots and wizards. It uses vibrant colors and exaggerated proportions for games and animation.
This Flux LoRA produces semi-realistic costume collections. It generates any historical or professional gear from helmets to boots in muted, natural tones.
Z-Image Turbo (Dec 2025) is a distilled 1-step model for nearly instant, real-time image generation with higher resolution than older LCMs. Fast and affordable
LTX-2 Retake by Lightricks enables precision re-rendering of video segments to fix performance or dialogue without full regeneration.
Image Slicer by Scenario cuts an image into a custom grid of smaller sections and saves each one as its own separate file.
Pixel Snapper cleans up pixel art by fixing scaling issues and keeping the blocks aligned for a sharp, retro look.
A specialized model built for generating seamless, top-down tilemaps with perfect grid alignment and terrain coherence.
A premium pixel-art generation model designed for exceptional quality, rich detail, and stylistic accuracy.
A dedicated animation model featuring specialized motion-aware generation, perfect for sprite sequences.
FLUX 2 (Dev) by Black Forest Labs is an open-weight model balanced for experimentation and custom training workflows. Pricing from 3 CU.
Meshy Rigging generates skeletal structures and skin weights for 3D meshes to enable character animation.
Hunyuan 3D Part is a 3D-to-3D segmentation model that breaks existing meshes into organized, editable sub-parts.
Gemini 3.0 Pro (Edit) by Google offers professional-grade conversational editing with deep context awareness. Gemini 3.0 Pro supports 1K, 2K and 4K output
Flash VSR Upscale Video by Tsinghua/Shanghai AI Lab provides real-time diffusion-based upscaling with a one-step streaming framework.
A video-to-video tool to replace subjects or backgrounds with a reference image. Features Person and Background modes for precise control.
Hunyuan 3D Pro 3.0 Sketch transforms hand-drawn line art and sketches into textured 3D meshes from 105 credits.
Hunyuan 3D 3.0 (Pro) by Tencent is a state-of-the-art 10B parameter image-to-3D model with 1536³ resolution and hierarchical DiT carving.
Hunyuan 3D Pro 3.0 Multiview reconciles up to 4 reference images to produce symmetric 3D assets from 120 credits.
Qwen Edit Multi-Angle by Alibaba provides camera-aware image editing for consistent perspective shifts. Generation in about 8s, pricing from 5 CU.
Gemini 2.5 Edit (aka NanoBanana) by Google enables rapid, text-based photo modifications and background adjustments from 7 credits.
Unified text-guided image editing with high character and style preservation. Pricing from 11 CU.
OpenAI's GPT Image 1 model for image editing and generation
OpenAI's top model for quality image editing
Seedream 4.0 by ByteDance generates and edits 4K images with unified architecture and complex reasoning. Pricing from 5 CU.
Seedream 4.5 by ByteDance edits images with natural language instructions while preserving reference details. Generation around 6 seconds, pricing from 6 CU.
P-Image Edit by Pruna AI is a high-speed image editing model that applies precise transformations, using up to 10 reference images. From 2 credits.
Meshy Remesh converts existing 3D models into cleaner, quad-based geometry to optimize performance and topology.
Meshy Image-to-3D generates a textured 3D mesh from one or more images, creating geometry for unseen sides.
Meshy Text-to-3D (v4, V5 and v6) creates textured 3D assets from prompts, with various control options.