Vidu Q1 image-to-video generation. 5 seconds, 1080p only.
Models
All Models
Vidu Q1 text-to-video generation. 5 seconds, 1080p only.
Vidu 2.0 references-to-video generation. 4 seconds, 360p/720p.
Vidu 2.0 image-to-video generation. 4s (360p/720p/1080p) or 8s (720p only).
A versatile, high-speed engine designed for efficient 3K generation and rapid creative iteration.
Fast tier (8 steps) for rapid iteration with synchronized audio.
Extend videos by generating continuation frames. Input a video and describe how it should continue.
Next-generation cinematic video model offering high-fidelity motion and extensive aspect ratio flexibility.
Generates native, fully editable vector graphics (SVG) directly from text prompts with clean geometry.
An aesthetic transformation tool that reapplies global artistic filters to existing 3D models.
Generate detailed SVG vector graphics from text prompts. Recraft V4 Pro's design taste with more geometric detail and finer paths — clean layers, editable output, and scalable to any size.
Premium high-fidelity model delivering super-resolution and superior anatomical precision for production-ready assets.
An automated skeletal detection system that prepares 3D meshes for movement by applying bones, skin weights, and preset animations.
The standard design-centric model for high-quality raster image generation with professional artistic flair.
An optimization tool designed to convert complex 3D meshes into clean, performance-friendly geometries.
A specialized model for generating high-fidelity PBR textures and materials for 3D models using text, image, or style references.
Reconstructs precise 3D assets using four-axis image inputs for superior dimensional accuracy and fewer visual gaps.
Generate and edit videos using xAI's Grok model — supports text-to-video, image-to-video (up to 15s), and video editing (up to 8.7s) with 480p/720p resolution and flexible aspect ratios
Generate and edit images using xAI's Grok model, powered by Aurora — supports text-to-image generation and image editing with multiple aspect ratios and batch output up to 10 images
Generate and edit high-resolution images using xAI's Grok Pro model, powered by Aurora — supports text-to-image generation and image editing with up to 2K resolution, multiple aspect ratios, and batch output up to 10 images
Darken edges of video to create a vignette effect.
Apply color tint overlays to video.
Create solarization effect by inverting colors above threshold.
Enhance video sharpness and detail.
Reduce color depth for a poster art effect.
Apply parabolic distortion effect to video.
Transform video with oil painting artistic effect.
Apply cinematic color grading with 3D LUT presets.
Add film grain texture with various film stock profiles.
Add glow and bloom lighting effects to video.
Apply dodge and burn photographic techniques.
Blend video with an image using dissolve transition.
Remove or reduce color saturation in video.
Transform video with abstract cubist art style.
Create crystallized superpixel mosaic effect.
Adjust color, brightness, contrast, and exposure.
Create chromatic aberration by shifting color channels.
Apply blur effects to video.
Specialized technical model for restoring and correcting product labels, logos, and intricate packaging branding.
Elite flagship model with an integrated reasoning engine for top-tier prompt fidelity and flawless typography.
High-speed, cost-efficient model optimized for rapid concept exploration and high-volume ideation.
AI text generation and image analysis. Generate one or multiple results from instruction, text inputs and images.
Compose multiple images, videos, and audio into a single video with layers, transforms, effects, transitions, and blending modes.
Compose multiple images into a single image with layers, transforms, effects, and blending modes.
Kling 3.0 Standard: Top-tier text-to-video with cinematic visuals, fluid motion, and native audio generation, with multi-shot support.
Kling 3.0 Standard: Top-tier image-to-video with cinematic visuals, fluid motion, and native audio generation, with custom element support.
Kling 3.0 Pro: Top-tier text-to-video with cinematic visuals, fluid motion, and native audio generation, with multi-shot support.