Introducing Gemini 3.1 🍌: High-quality text-to-image and image-to-image generation
Create on-brand visuals at speed with Gemini 3.
What is Gemini 3.1 🍌?
Create on-brand visuals at speed with Gemini 3.1 🍌, Google's latest image generation model. Whether you're prototyping new concepts or producing campaign assets at scale, Gemini 3.1 🍌 fits into your creative workflow and turns text prompts into polished imagery fast — no back-and-forth, no bottlenecks.
Generation is just the starting point. Gemini 3.1 🍌 supports prompt-based editing, so you can refine existing images with simple text commands — adjust lighting, shift styles, update elements — all without leaving the platform. Visuals are grounded by Google Search for added realism and contextual accuracy, and every image includes an invisible digital watermark for built-in transparency.
Key Capabilities
- Generate high-quality images from text or existing images
- Edit and refine visuals using simple text prompts
- Produce content across various aspect ratios and resolutions
- Ground visuals in Google Search for enhanced realism and accuracy
- Built for speed and high-volume creative production
Examples
Here's what Gemini 3.1 🍌 can do across a range of styles and compositions.

A stylish young woman posing confidently indoors with dramatic red lighting filling the room. She is dressed in a coordinated red sweater with a large bow detail and a pleated skirt, paired with matching knee-high socks. Her stance is strong and self-assured, with one hand placed on her hip while she looks directly at the camera. The red LED glow in the background enhances the bold and eye-catching aesthetic of the scene.
{
"subject": "Close-up side profile of a joyful young tiger anime character with a massive, toothy grin and closed eyes.",
"art_style": "Abstract expressive anime portrait, modern graphic illustration, dynamic digital painting, energetic pop art.",
"color_palette": {
"overall_tone": "Ultra-vibrant, highly saturated, high contrast.",
"dominant_colors":["Bright golden yellow", "Vivid cyan/light blue", "Crimson red", "Warm peach", "Deep black"],
"shading_colors":["Deep pink", "Muted violet", "Dark red"]
},
"technique_and_medium": {
"primary_medium": "Digital illustration mimicking a blend of marker sketches and thick acrylic paint.",
"brushwork": "Sharp, geometric color blocking mixed with chaotic, loose strokes.",
"shading": "Fractured cel-shading with hard edges, entirely lacking smooth gradients."
},
"linework": {
"style": "Erratic, sketchy, thin black ink lines.",
"characteristics": "Rough cross-hatching, overlapping strokes, lines that do not strictly enclose shapes but rather suggest form and movement, raw and unfinished aesthetic."
},
"visual_effects": {
"dispersion": "Shatter/splinter effect, sharp geometric polygons and shards of solid color breaking away from the subject.",
"movement": "Dynamic directional flow, diagonal color streaks and flying paint splatters suggesting high wind or explosive energy.",
"edges": "Jagged, fragmented silhouettes blending into the background."
},
"lighting": "Harsh, stark directional lighting that creates distinct blocks of highlights and deep, hatched shadows.",
"background": "Pure, stark white (#FFFFFF) to act as negative space, emphasizing the vibrant explosion of colors from the subject.",
"mood_and_atmosphere": "Euphoric, intensely energetic, chaotic, triumphant, free-spirited, raw emotion."
}
A sweeping aerial view of a tranquil Pacific lagoon with crystal-clear turquoise waters and lush tropical foliage along the shore.
{
"subject_character": {
"identity": {
"demographics": "teenage girl, 17, rebellious appearance",
"hair": "dyed black hair with fading roots, messy chin-length",
"face": "smudged eyeliner from earlier in the day, sullen pout",
"body": "slim, petite, olive skin tone, manicured fingers"
},
"action_and_pose": {
"body_language": "leaning back, legs crossed and resting on the desk",
"gesture": "fiddling with a ballpoint pen",
"expression": "bored, defiant, staring at the desk"
},
"attire": {
"outfit": "oversized school blazer with lint and chalk dust on the lapel",
"accessories": "safety pin through the blazer lapel"
}
},
"scene_atmosphere": {
"location": {
"setting": "empty high school classroom",
"details": "dust motes dancing in sunbeams, high windows, stacked chairs in background"
},
"lighting": {
"source": "morning sun through high windows",
"direction": "slanted side lighting",
"quality": "dappled, soft shadows, volumetric dust"
},
"ambience": {
"season": "late summer",
"weather": "warm morning"
}
},
"artistic_direction": {
"photography_gear": {
"lens": "85mm prime",
"aperture": "f/1.4",
"focus": "shallow depth, sharp on eyeliner and hair texture"
},
"aesthetic_style": {
"film_stock": "Medium Format Digital",
"rendering": "matte finish, rich textures",
"color_grade": "warm nostalgic amber",
"mood": "lofi indie movie vibes"
},
"composition": {
"framing": "full body shot",
"angle": "low angle",
"visual_devices": "shallow bokeh blurring the empty classroom"
}
}
}
Show me the character in four side by side variants (e.g. outfit, pose, details). Following these instructions: Very different poses and experessions. On one single uniform grey background, seamless, no split.

Make a full-body turnaround sheet of this exact robot. Three full-body poses on a pure white background — front view, left profile, and back view — evenly spaced in a horizontal row and in a consistent style.
A cute chibi anime robot. Uniform background saying like in a comit "Nano Banana 2 is on Scenario"


Merge the two provided images into a single, unified scene following the specified instruction: Add the alien to the soccer field scene, looking up a the guy as the guy looks down to him, no coffee gup, while ensuring each subject's face, identity, proportions, and outfit are consistently preserved. Ensure natural integration with coherent lighting, perspective, and scale so both characters appear realistically together. Generate a background consistent with the input images' style, or adapt it according to the instruction. Adjust body position, angle, or expression where necessary so the characters feel naturally posed together instead of copied side by side, adapting to the context of the instruction. Create interaction through body language, gestures, shared props, or eye contact, making them visibly connected within the scene, per the instruction. Apply proper shadows for physical accuracy, producing a polished composite suitable for group photos, character interactions, or cinematic posters.
A stylized illustration of a girl with short black hair and bright green eyes, wearing a sleek black and blue bodysuit, stands confidently in chunky platform sneakers. She is surrounded by dynamic abstract shapes, smoke-like swirls, and geometric symbols on an orange background, giving the image a futuristic and energetic vibe.
Portrait of a young cyborg woman, one half human with warm skin tones, the other revealing golden circuits and a glowing blue cybernetic eye, dark laboratory background with blurred holograms, dramatic sci-fi realism.
Generate a technical concept sheet showing only the individual armor pieces of the main hero.
Display the armor components separately and clearly, as if on a design or blueprint sheet.
Use neutral lighting and a clean background, focusing on shape, structure, materials, and functional details.
No character pose or personality, only the armor parts presented for clear readability and design reference.

Generate an 8-direction turnaround of the character in the uploaded image. Use the uploaded character as the true front reference. Keep a neutral idle pose with the same arm positions, legs, props and facial expression in all views. Only rotate the character around its vertical axis.
Create exactly these eight views, each one a separate full-body sprite:
Bottom center: true front view, facing directly toward the viewer, both eyes visible.
Bottom right: front-right 3/4 view (about 45 degrees).
Right: exact right side view (90 degrees).
Top right: back-right 3/4 view (about 135 degrees).
Top center: true back view, facing directly away from the viewer.
Top left: back-left 3/4 view (about 225 degrees).
Left: exact left side view (270 degrees).
Bottom left: front-left 3/4 view (about 315 degrees).
Keep the character’s proportions, colors, costume details, equipment and silhouette perfectly consistent across all eight views. Do not change the pose, gesture, or weapon position between views; only the viewing angle should change. Arrange the eight sprites evenly in a circular layout on a plain flat background.

Create a full-page technical illustration of the robot, based on the reference image.
Depict the machine using a richly rendered painted style, with a cutaway/exploded layout that exposes all internal structures: armor layers, frame supports, actuators, servo assemblies, hydraulic or mechanical systems, power modules, cooling channels, sensor arrays, weapon components, and routing of cables and conduits.
Include numerous fine callout lines with small technical labels identifying key parts across the illustration.
Surround the main visual with short text blocks describing the robot’s purpose, design philosophy, engineering challenges, and unique capabilities.
Design the layout like a high-end technical encyclopedia or machinery analysis book: title header, subheads, detailed captions, margins with subtle visual markers, and a clean “Data File” panel containing specifications (manufacturer, model, height, weight, power source, armament, role/class, and notable features).
Keep the lighting dramatic but clear, with realistic metallic textures, environmental reflections, and internal glow from components that require energy or heat dissipation.
Avoid blueprint aesthetics; prioritize a painted, semi-realistic technical cutaway illustration that feels like a premium machinery reference spread.

Generate a finished character asset image using the provided character as reference.
The final output must match exactly the same layout, proportions, thumbnail formats and visual hierarchy as the reference image.
Main layout (ABSOLUTE)
• Canvas: horizontal
• Three thumbnails aligned horizontally, side by side
• Even spacing between thumbnails
• Centered vertically on a neutral light background
• No full body character in this image
The image must match the reference at first glance:
three rectangular thumbnails in a horizontal row, each with a different format and size.
Thumbnail order (LEFT → RIGHT)
LEFT – Upgrade state
• Rectangular horizontal small format (wider than tall)
• Bust shot of the character
• Green upward arrow overlay
• Subtle green glow effect
• Arrow must not cover the face
• Light gray background
• Thin gray border
MIDDLE – Locked state
• Rectangular vertical format (taller than wide)
• White silhouette of the character (close-up)
• No internal details
• Flat graphic look
• Light gray background
• Thin gray border
RIGHT – Default portrait
• Rectangular horizontal large format
• Larger than the left thumbnail
• Bust shot with more context (face and weapon/hand visible)
• No arrow
• No glow
• Clean presentation
• Light gray background
• Thin gray border
Consistency rules (STRICT)
• All thumbnails represent the same character
• Thumbnails LEFT and RIGHT use the same camera angle, with different crop scale
• Silhouette matches the character proportions
• Do not redesign, stylize or reinterpret the character
Style
• Stylized 3D cartoon
• Clean materials
• Soft lighting
• Game UI focused
Final constraints
• Same overall canvas size as reference
• Same spacing and alignment
• Same visual hierarchy
• Do not standardize thumbnail sizes
• No layout experimentation

Show me the character in four side by side variants (e.g. outfit, pose, details). Following these instructions: Very different poses and experessions. On one single uniform grey background, seamless, no split.
Create a 3x3 grid of 9 unique trading cards using each uploaded image. Each card should feature one of the uploaded images as the character or theme. Design all cards in the specified trading card style with authentic layouts, stats to the top and bottom, attributes, type designations, and descriptions.
Each card must be unique with different stats and abilities.
Arrange the 9 cards in a clean 3x3 grid with equal spacing. Use white background outside the grid.
Output in 3:4 vertical format for pack display.

Create a full character turnaround sheet featuring the front, side, and back views of this character, standing, each labeled accordingly beneath the pose. Ensure the style, proportions, and details remain consistent across all angles. Display the turnaround on a clean, neutral grey background. Remove weapons or anything the character is holding.

Transform the provided images into a vibrant game UI overlay with bold, colorful buttons and a prominent game title placed around the character(s).
Add clear interface elements that fit a modern game theme, such as ability icons or menu tabs, while ensuring the design is readable and visually balanced.

Make a full character concept sheet for this character including labeled views or poses, some key facial expressions. Add detailed callouts for important features, materials, patterns, accessories, or gear. Include a clean title header for the character's name, plus small text notes or annotations describing design choices, materials, or movement. Incorporate a color palette swatch panel and 2-3 optional, diverse silhouette thumbnails. Additional instructions: Create a polished character reference sheet. Include: full-body front view, full-body back view, 3 face expressions, 3 close-ups of important design details (hair, outfit, accessories), and a clean color palette section.

Create an eye-catching ad creative for wireless headphones.
Use neon or vibrant color accents, dynamic lighting, and energetic motion effects around the product.
Add a clear tagline like “Unlock Your Sound,” and a short benefit statement (comfort, transparency mode, deep bass).
Place a subtle call-to-action label such as “Listen Now” or “Shop Today.”
Ensure the final design is aesthetic, modern, and optimized for square or vertical social formats.


Using the uploaded human character strictly as a class, role, and fantasy archetype reference, and the uploaded animal as the primary anatomical, physical, and expressive foundation, create an original stylized 3D cartoon humanoid creature.
The character’s class behavior, equipment, and abilities must be determined exclusively by the human reference.
• If the human reference is a mage, the final character must clearly be a mage (no firearms, no modern weapons, no soldier gear).
• If the human reference is a soldier, then firearms and tactical gear are allowed.
Do not reinterpret or replace the class. The class identity is mandatory and non-negotiable.
The final design should be a humanoid animal creature, not a human in costume. The anatomy does NOT need to be perfectly human:
• Legs may remain digitigrade or animal-like
• Spine, neck, and posture should follow the animal’s natural structure
• Hands, arms, and torso may be partially humanoid only as needed for class actions
Blend the animal’s body mass, proportions, muscle flow, fur or skin texture, head shape, and natural stance with the fantasy class in a bold and exaggerated way. Avoid clean human anatomy with animal head swaps.
The pose must be extremely exaggerated, expressive, and theatrical, inspired by the animal’s instinctive behavior combined with the class fantasy. Push asymmetry, dramatic curves, strong silhouettes, and dynamic balance.
The facial expression should be over-the-top and charismatic, clearly communicating emotion, intent, and personality at a glance. Emphasize stylized cartoon acting: intense focus, feral intelligence, mystical confidence, controlled rage, or predatory calm, depending on the animal and class combination.
Equipment and magical elements must be appropriate to the class and adapted organically to the animal-humanoid anatomy.
• A mage should wield staves, arcane constructs, magical manifestations, or ritual objects
• Magic may emerge from claws, eyes, mouth, spine, or tail if appropriate
Render the character in a polished stylized 3D cartoon style with strong shape language, exaggerated proportions, and high personality appeal, suitable for a contemporary fantasy or hero-style game.
Place the character in an environment that reinforces the fantasy class and animal nature, such as arcane ruins, mystical arenas, ancient temples, corrupted landscapes, or magical battlefields. The background should enhance mood and identity without overpowering the character.
Use dramatic, cinematic lighting that emphasizes form, expression, and magical energy. The final image should look like official key art for a high-impact fantasy game character.

Turn the building in the photo into an isometric 3D model, on a square tile. Keep a simple, 3D render style.

Recreate the character strictly based on:
Character Description: A fantasy character of unusual beauty, with ethereal features and a deep gaze that seems to hold ancient secrets. Her exotic appearance combines mystical and elegant elements, creating a stylish and enchanting look. She conveys power and grace at the same time, like someone who belongs to a magical and fascinating world.
Art Style Description: Cute 3D Character Art, featuring highly detailed, soft fur textures and large, expressive eyes on a charming, vibrant creature design, rendered with a polished, clean aesthetic.
The final result must faithfully match the provided reference image. Do not reinterpret, redesign, stylize, or adjust proportions.
All facial features must remain identical to the reference, including eye shape, size, spacing, eyelid thickness, iris proportion, and expression. Do not enlarge, soften, or exaggerate features.
Preserve the exact silhouette, anatomy, hair volume, costume structure, ornament placement, and overall proportions. The style must adapt to the reference image without altering structural fidelity.
This is a direct translation into a final polished render, not a reinterpretation. Prioritize accuracy over stylization.
A Mad Max-style racing car with off-road tires and a very aggressive look. Dynamic camera.
A whimsical woodland village scene features adorable animals such as foxes, squirrels, a bear, an owl, and a rabbit engaging in magical and scholarly activities. There’s a cozy house with bubbling potions outside, a grand tree with a library inside where a bear reads to cubs, a crystal-topped magic tower with swirling energy, an outdoor potion market, and a fortified gate guarded by an armored badger. The village has colorful crystals, ancient ruins, and soft, glowing lights, creating a charming enchanted atmosphere.
A blue, fluffy creature with large eyes sits on the ground, wearing red sneakers and white leg warmers, set against a bright solid yellow background.
{
"meta": {
"style": "8k raw photo, hyper-detailed, photorealistic masterpiece, National Geographic aesthetic",
"creativity_temp": 1.8
},
"subject": {
"identity": "Realistic interpretation of Clash Royale Hog Rider and mount. Rider: muscular dark-skinned male, defined vascularity, signature black mohawk, gold nose ring. Hog: massive boar, pinkish-grey skin, prominent ivory tusks.",
"pose_action": "Mid-gallop across shallow water. Hog's front hooves smashing into the brine, generating a crown-splash of saline droplets. Rider leaning forward, gripping leather reins, golden warhammer raised.",
"material_detail": "Rider Skin: PBR subsurface scattering, visible sweat pores, glistening moisture, hyper-realistic melanin texture. Hog Fur: Coarse, stiff bristles, wet and matted near legs, distinct follicle density. Leather: Worn saddle texture, cracked edges. Metal: Hammer gold with micro-scratches and oxidation."
},
"environment": {
"location": "Salar de Uyuni, Bolivia. Infinite horizon where sky meets earth.",
"background_elements": "Seamless mirror reflection of the azure sky and cumulus clouds on the ground. Hexagonal salt crust patterns visible through translucent shallow water.",
"atmosphere": "High-altitude clarity, thin air, zero haze. Water surface tension breaking at impact points. Crystalline salt particles suspended in splash droplets."
},
"lighting": {
"source_angle": "High-noon zenith sun, hard directional light, 90-degree angle.",
"kelvin_quality": "5800K pure daylight, blinding white albedo from salt reflection.",
"visual_effects": "Ray-traced reflections, harsh contact shadows, specular highlights on wet skin and water ripples, slight chromatic aberration on water droplets."
},
"camera_specs": {
"gear_lens": "Phase One XF IQ4 150MP, 28mm wide-angle prime lens.",
"aperture_iso": "f/11 for deep depth of field, ISO 50, 1/4000s shutter speed to freeze water.",
"film_finish": "Kodak Ektar 100 simulation, high contrast, saturated blues and golds, ultra-sharp focus."
}
}
{
"meta": {
"type": "photorealistic_image_generation",
"style": "8k raw photo, hyper-detailed, masterpiece"
},
"subject": {
"appearance": "Fit young woman, natural fair skin, wet hair slicked back.",
"pose": "Standing on wooden boat edge, arms raised with hands behind head adjusting hair. Looking downward, subtle pout. One leg straight, other bent at knee on gunwale.",
"focus": "Full body shot, defined abdominal muscles, hourglass figure."
},
"attire": {
"garment": "Mismatched bikini swimwear.",
"details": "Top: Left cup features a geometric pattern in light blue, dark green, and dark navy; Right cup features a geometric pattern in bright yellow, orange, and red. Bottoms: Matching abstract overlapping diamond pattern incorporating all six colors (light blue, dark green, dark navy, yellow, orange, red) with side ties.",
"accessories": "Black, thick-rimmed oval sunglasses."
},
"environment": {
"location": "Mediterranean coastal scene, daytime.",
"background": "Towering textured limestone cliffs, sparse green shrubbery. Deep blue, rippled ocean.",
"elements": "Distant small white tourist boats anchored near cliff base. Wooden boat railing in foreground."
},
"lighting": {
"source": "Natural daylight.",
"quality": "Soft, diffused sunlight creating gentle torso shadows; highlights skin texture and water wetness."
},
"camera_specs": {
"gear": "Sony A7R IV, 35mm lens.",
"settings": "f/2.8 aperture for background bokeh, fast shutter speed.",
"textures": "High fidelity skin, realistic water droplets, detailed rock formations."
}
}
{
"style": "ultra-realistic studio portrait",
"subject": {
"gender": "female",
"age": "young adult",
"pose": "leaning slightly forward toward the camera",
"expression": "playful, flirty",
"facial_details": {
"wink": true,
"tongue_out": true,
"freckles": "natural across fair skin",
"makeup": {
"blush": "soft pink",
"lips": "glossy"
}
},
"hair": {
"color": "redhair",
"length": "long",
"part": "side-parted",
"style": "falling naturally over shoulders"
},
"outfit": {
"dress": "off-shoulder fitted black dress",
"jewelry": {
"earrings": "long dangling gold earrings",
"necklaces": "layered gold necklaces with small heart pendant"
}
}
},
"environment": {
"setting": "studio",
"background": "clean minimal light neutral tones"
},
"lighting": {
"type": "soft diffused studio lighting",
"shadows": "smooth natural shadows"
},
"camera": {
"lens": "50mm",
"aperture": "f/1.8",
"depth_of_field": "shallow"
},
"quality": {
"resolution": "high resolution",
"detail": "ultra-detailed",
"skin_texture": "photorealistic",
"focus": "sharp focus",
"photography_style": "high fashion lifestyle photography"
}
}About the Provider
Google's Gemini models represent the company's latest work in AI built for creative and technical workflows — powerful tools designed to stay out of your way and let you move faster.
Related models
Gemini 2.5
Gemini 2.5 Edit (aka NanoBanana) by Google enables rapid, text-based photo modifications and background adjustments from 7 credits.
Veo 3.1 (Fast)
Google's Veo 3.1 (Fast) is a high-speed variant of Veo 3.1, offering its advanced features and multiple input types for rapid, cost-effective prototyping.
Veo 3.1
Google's Veo 3.1 is a state-of-the-art model for 1080p video, offering superior prompt adherence and advanced controls like start/end/reference frames.






