Meshy Retexture applies new PBR texture maps to existing 3D models based on text-guided instructions or style images.
Models
All Models
Flux.2 [klein] 4B is a fast, lightweight model built for real-time image generation.
FLUX.2 [klein] 4B Base is a flexible base model designed for control and customization.
FLUX.2 [turbo] from Black Forest Labs (Nov 2025) is a dual image generation and editing model, designed for speed and cost-efficiency
FLUX.2 [klein] 9B Base is a high-capacity model focused on maximum detail and prompt understanding.
FLUX 2 (Max) Edit is the highest-fidelity editing model for maximum consistency and professional retouching. Pricing from 11 CU.
FLUX 2 (Flex) Edit: Precision editing focused on typography, small details, and complex layout changes. Pricing from 18 CU.
FLUX 2 (Pro) Edit is a fast, reliable default for practical editing tasks like object removal and background cleanup. Pricing from 7 CU.
LongCat Image by Meituan provides natural-language-driven image editing with high semantic awareness.
Sparc3D Portrait by Hitem3D is a High-fidelity 1536³ voxel reconstruction optimized for human facial anatomy and expressions.
MiniMax Hailuo 2.3 (Fast) by MiniMax generates high-motion video with optimized latency. Pricing from 29 credits per generation.
MiniMax Hailuo 2.3 by MiniMax generates cinematic 1080p video with advanced motion consistency. Pricing from 43 credits per generation.
BiRefNet v2 provides high-resolution foreground extraction for complex objects like hair and transparent edges.
Seedance 1 (Pro Fast) by ByteDance generates 1080p cinematic video optimized for speed and cost efficiency. Pricing from 45 CU.
Abandoned Structures - Kontext transforms clean building images into worn, decayed versions while keeping the original structure and layout the same.
LTX-2 Fast by Lightricks generates high-fidelity 4K video previews in seconds for rapid brainstorming. Pricing from 32 credits.
-2 Pro by Lightricks delivers 4K-capable video with audio for professional reviews and pitches. Pricing from 47 credits.
Kling 2.5 I2V (Standard) by Kuaishou is a cost-effective image-to-video model for 720p output, engineered for speed. Pricing from 35 CU.
Facial Expression Sheet - Kontext turns a character image into a grid of nine different emotions while keeping their appearance and the art style the same.
REVE Remix by Halfmoon AI is a context-aware image merging and object-level manipulation using text prompts. Pricing from 6 credits.
Crystal Upscaler by Clarity AI specializes in high-precision facial and portrait enhancement. Pricing from 10 credits.
Flux Kontext LoRA turns basic 3D blockouts into detailed scenes or objects while making sure the original shapes and layout stay the same.
Google's Veo 3.1 (Fast) is a high-speed variant of Veo 3.1, offering its advanced features and multiple input types for rapid, cost-effective prototyping.
Flux Kontext LoRA creates character turnaround sheets with four different views to show a design from every side on a clean background.
Isometric Tile Maker - Kontext turns photos of buildings into detailed, small-scale 3D models set on square tiles.
Hunyuan Image 3 by Tencent is an 80B parameter text-to-image model featuring a Mixture-of-Experts (MoE) architecture for high-resolution visual synthesis.
Rodin Gen-2 by Deemos: is a 10B parameter model generating high-poly quad meshes from single or multi-view images.
SeedVR2 by SeedVR is a high-quality super-resolution model that scales images to 4K with texture refinement. Pricing from 5 credits.
SeedVR2 Upscale Image by ByteDance is a one-step diffusion upscaler for high-detail restoration up to 16MP. From 5 credits.
Omni Human 1.5 by ByteDance creates realistic talking avatars from a single image and audio with film-grade lip-sync. Pricing from $0.16/generation.
Wan 2.5 I2V is an open-source model by Alibaba, animating images into 1080p videos with synchronized audio in one pass. Generation from 55 CU.
Short Description: Wan 2.5 T2V is an open-source model by Alibaba, creating 1080p videos with synchronized audio in a single pass. Generation from 55 CU.
Kling 2.5 T2V (Pro) by Kuaishou is a text-to-video model for 1080p output, engineered for faster generation speeds. Pricing from 55 CU.
Kling 2.5 I2V (Pro) by Kuaishou is an image-to-video model for 1080p output, engineered for faster generation speeds. Pricing from 55 CU.
Wan 2.2 Animate (Replace) is a mode of an open-source model by Alibaba that swaps a person in a video with a character image.
Wan 2.2 Animate (Move) is a mode of an open-source model by Alibaba that animates a static character using motion from a video.
Wan 2.2 Reframe is an open-source AI video editor by Alibaba that intelligently changes video aspect ratios with content-aware framing. Unavailable.
Lucy Edit (Pro) by Decart provides stable, production-ready video transformations including wardrobe and scene replacements via text.
Lucy Edit (Dev) by Decart is a developmental editing model optimized for rapid experimentation and lightweight prototyping.
Wan 2.2 Outpainting is an open-source AI video editor by Alibaba that expands video frames by generating new, context-aware content.
Sync Lipsync 2 is a video-to-video model that syncs mouth movements in a video to a provided audio track, producing natural-looking speech alignment, up to 4K resolution.
Luma Modify Video by Luma AI transforms existing footage with environment and texture swapping from around 0 credits.
ElevenLabs 3 (Alpha) generates highly expressive, emotional speech in over 70 languages with non-real-time generation, starting from 15 CUs.
A specialized model to synchronize an audio track with a speaker's mouth movements in a video, creating realistic, high-quality dialogue.
Meta MusicGen generates music from text and can be guided by a reference melody, creating up to 30-second clips. Pricing starts from 15 CUs.
Creatify Lipsync by Creatify produces realistic mouth-movement animations for social-first marketing videos.